Hi all,
I have a question about my input file. For some reason, i can't seem to get any decent output.
It a huge input file, so i tried to mimic what it looks like:
@CONVERTED_FROM_TEXT
@ITEM=98=20205AutoStopopen
@ITEM=5=10012Safetyguardstopstate
@ITEM=163=40538MaxTimeExpired
@ITEM=114=20314Enable2supervisionfault
@ITEM=126=20481SCOVRactive
@ITEM=85=20074Notallowedcommand
...
etc
@ITEM=-1=|
211 211 211 211 211 211 211 39 24 25 50 211 39 50 25 24 -2
39 50 25 24 39 5 98 3 24 4 50 25 39 25 50 24 39 24 50 25 39 5 98 3 25 50 24 -2
58 59 60 67 66 58 59 60 39 98 5 3 4 25 50 24 460 460 460 460 460 -2
....
etc etc
I have 491 integers, all mapped to these string values. Some things that i spot that maybe are resulting in 0 sequences found:
- the int --> strings are not in order at the top of my dataset
- i have to splits in my sequence that are used as a -1 like:
1 -1 1 2 3 -1 1 3 -1 4 -1 3 6 -1 -2
But i don't have those splits in my dataset. Each line is one huge sequence.
- I am using the wrong algorithm (PrefixSpan, CM_SPAM)
My goal is to find closed sequential patterns. What am i doing wrong?