Re: Where to start ?
Date: April 10, 2014 08:22AM
"i don't know what to use as database input format for converting"
I'm not sure what you mean by this. If you mean that you want to convert from CSV_integer to SPMF format for a transaction database, then yes, there is a tool that is offered for that in SPMF (see the documentation).
"What is the min processor capacity required for mining very large databases?"
It depends which algorithms you are using, what parameters you are using for the algorithm and what kind of data you have as input.
Usually, when parameters are set lower, the number of patterns can increase exponentially in some cases. If your data has very long transactions or sequences or as few items repeated multiple times, or many similar transactions/sequences it will also increase the number of patterns and thus the memory and execution time required. Also, some algorithm are faster than some others.
So the answer to this question depends on (1) the algorithm, (2) the data, (3) the parameters.