gspan does not output all frequent subgraphs

Posted by:
**
szevin
**

Date: January 17, 2022 08:45PM

Hello,

Given graph dataset of two identical graphs:

t # 0

v 0 2

v 1 2

v 2 2

v 3 2

v 4 2

e 0 1 0

e 1 2 3

e 1 4 3

e 2 3 3

e 3 4 3

t # 1

v 0 2

v 1 2

v 2 2

v 3 2

v 4 2

e 0 1 0

e 1 2 3

e 1 4 3

e 2 3 3

e 3 4 3

cgspan does not output the maximal frequent subgraph

t # 0 * 2

v 0 2

v 1 2

v 2 2

v 3 2

v 4 2

e 0 1 0

e 1 2 3

e 1 4 3

e 2 3 3

e 3 4 3

x 0 1

Thanks,

Shaul

Posted by:
**
webmasterphilfv
**

Date: January 18, 2022 12:46AM

Thanks for reporting the bug.

The solution is to deactivate the SKIP strategy in the code of gSpan and TKG.

**/** skip strategy */**

private static final boolean SKIP_STRATEGY = false;

We already discussed about that by email. But I post the solution here just in case some other people are looking for it.

The bug will be fixed in the next release of SPMF.

