Re: TKS Heap Space Memory Error
Date: March 19, 2019 06:26AM
Hi,
Using a minimum pattern length of 12 is probably one of the reasons why it takes so much resources.
Generally, if you use more strict constraints, the algorithm will be faster. I would suggest to:
- use the maximum length constraint instead... and to set it to a small value such as 3 or 4. Then if it runs, you can increase it to larger values. But if you set minlength = 12, the search space will be huge. The minimum length constraint does not help to reduce the search space, and makes it worse, while the maximum length constraint can greatly reduce the size of the search space.
- You may also consider adding other constraints such as using a maximum gap. This can also help to reduce the number of possibilities.
Actually, even if a dataset just contain 60k sequences, if the sequences are very long and similar, the search space can still be very big!
Besides, the above suggestions, another possibility is to do some preprocressing on the to remove some irrelevant items, or apply other transformations that can make the data more simple.
Best regards