The Data Mining Forum                             open-source data mining software data mining conferences Data Science for Social and Behavioral Analytics DSSBA 2022 data science journal
IMPORTANT: This is the old Data Mining forum.
I keep it online so that you can read the old messages.

Please post your new messages in the new forum: https://forum2.philippe-fournier-viger.com/index.php
 
gspan does not output all frequent subgraphs
Posted by: szevin
Date: January 17, 2022 08:45PM

Hello,

Given graph dataset of two identical graphs:
t # 0
v 0 2
v 1 2
v 2 2
v 3 2
v 4 2
e 0 1 0
e 1 2 3
e 1 4 3
e 2 3 3
e 3 4 3
t # 1
v 0 2
v 1 2
v 2 2
v 3 2
v 4 2
e 0 1 0
e 1 2 3
e 1 4 3
e 2 3 3
e 3 4 3

cgspan does not output the maximal frequent subgraph

t # 0 * 2
v 0 2
v 1 2
v 2 2
v 3 2
v 4 2
e 0 1 0
e 1 2 3
e 1 4 3
e 2 3 3
e 3 4 3
x 0 1

Thanks,
Shaul

Options: ReplyQuote
Re: gspan does not output all frequent subgraphs
Date: January 18, 2022 12:46AM

Thanks for reporting the bug.

The solution is to deactivate the SKIP strategy in the code of gSpan and TKG.

/** skip strategy */
private static final boolean SKIP_STRATEGY = false;



We already discussed about that by email. But I post the solution here just in case some other people are looking for it.

The bug will be fixed in the next release of SPMF.winking smiley

Options: ReplyQuote


This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.