The Data Mining Forum                             open-source data mining software data mining conferences Data Science for Social and Behavioral Analytics DSSBA 2022 data science journal
IMPORTANT: This is the old Data Mining forum.
I keep it online so that you can read the old messages.

Please post your new messages in the new forum:
Posted by: qinqinzhou
Date: November 28, 2021 05:34AM



Options: ReplyQuote
Re: 数据库的转换
Posted by: qinqinzhou
Date: November 28, 2021 04:06PM


Options: ReplyQuote
Re: 数据库的转换
Date: November 28, 2021 09:16PM


Thanks for using SPMF.

This data is suitable for SPMF. However, as you have noticed the data must be transformed to be used by SPMF.

To transform the data, there is some tools available in SPMF. However, it is impossible to have a tool for all possible types of data. If the type of data is not supported, you may write a simple program or script to convert the data by yourself.

I see that your data is like a table. If your data is in an Excel file, you may first export it to a CSV file.

Then, after that you could modify the format by hand or using your own script for example...

What format you should use? It depends on what you want to do.

If you want to apply a frequent itemset mining algorithm, you could encode the data like that:

1 3 4
2 3 5
1 2 3 5
2 5
1 2 3 5

Here this is the format required by the Apriori algorithm (see documentation: )

This format means that the first line contains 1, 3 and 4, which are Apple, Tomato and Milk.

The second line contains the items 2, 3 and 5, which means Orange, Tomato and Bread.

For your data, the meaning of 1,2,3,4,5... would be different. It would be the Chinese terms.

Hope that this give you some helps!

Best regards,


Edited 1 time(s). Last edit at 11/28/2021 09:17PM by webmasterphilfv.

Options: ReplyQuote

This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.