The Data Mining Forum                             open-source data mining software data mining conferences Data Science for Social and Behavioral Analytics DSSBA 2022 data science journal
IMPORTANT: This is the old Data Mining forum.
I keep it online so that you can read the old messages.

Please post your new messages in the new forum: https://forum2.philippe-fournier-viger.com/index.php
 

Pages: PreviousFirst...56789...LastNext
Current Page: 7 of 67
Results 181 - 210 of 2010
3 years ago
webmasterphilfv
The list has been updated.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, I will try to do it for you in the next days. Best regards, Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, You could use SPMF-PY to call the Charm implementation of SPMF from your Python code. https://github.com/LoLei/spmf-py Best regards, Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Irfan, Glad it is useful. I think it could be modified for rare class association rules. However, there are several definition of what is "rare". If "rare" just means to have a support lower than some threshold maxsup, then I think it would not be hard to do. But if you use other definitions of what is a rare rule, maybe it is more complicated. But in any case,
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
You are welcome!
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, I will try to explain this optimization with a simple example. Let's say that we have a database containing only three transactions: Transaction 1: A B C D E F Transaction 2: A B C D E F Transaction 3: A B C Then, the algorithm will star to search for the closed itemsets. At some point the algorithm will find {A,B,C} and will then want to extend that itemset further to find
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Yes, that EUCP structure is a matrix that stores the TWU of all pairs of items (two items). It would be possible to build a structure to precalculate the TWU of itemsets containing more than 2 items, but perhaps that it would require too much memory so that is why we did not try it. But although, the EUCP structure only stores the TWU of two items, it can be used to reduce the search sp
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi all, It is my pleasure to announce that I am co-editing an upcoming Springer book called "Tracking and Preventing Diseases with Artificial Intelligence". If you have some research related to this topic, please consider submitting a chapter! The deadline is the 13th November 2020 (extended). More details about the book can be found here below: Call for Book Chapters on “Tra
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
CALL FOR CHAPTERS Periodic Pattern Mining: Theory, Algorithms and Applications Editors: Rage Uday Kiran, Philippe Fournier-Viger, Anirban Mondal, José María Luna Jerry Chun-Wei Lin An edited book to be published by Springer in 2021 Chapter proposal deadline: 1st October 2020 Proposal acceptance date: 10th October 2020 Full chapter submission deadline: 15th January 2021
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Ok! Good. Hope you get some good results. :-)
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Sorry for the long delay to answer. My schedule has been very busy this week. I tried to answer you before but I lost the message and had to write again. Happy that the software is useful. And welcome to the forum. I think that there are many algorithms that could be applied and it depends a bit on what you do and also how you prepare the data. I think that the basic data that you
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi all, I just want to remind you that there is still about two months for our UDML 2020 workshop at ICDM2020 about Utility Mining and Learning! It is a good place to submit your pattern mining papers, and also machine learning papers! http://www.philippe-fournier-viger.com/utility_mining_workshop_2020/ Best regards, Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Yes, it is a decent conference. Maybe not in the top 10 for data mining, but still an OK conference. If you live in Australia or around especially, I think it is worth considering it.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Thanks for visiting the forum. Yes, it is certainly a topic that could be done. But personally, I dont like too much the topic of erasable itemset mining. Compared to some other topics, I did not see much application related to this. That is just my opinion. Best regards,
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi! Glad you have found the solution as I am not a Python user. So that is good that you have found the solution by yourself, and also happy that you share it. Could you help me to write a small tutorial explaining how to use SPMF from Python to run an algorithm? Then, I could put it on the official webpage of SPMF and I could also add you as a contributior. I think that many people wou
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Maybe you have not followed the correct input format. Please see the documentation. It is important to use the -1 and -2 in the input file. If there is no -1 and if there is other problems in the input file, then it is possible that you get strange results. Also, it is possible that you get no result because the minsup threshold is too high. Then you should decrease it. Best regards,
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, I see. Then, I think it can be appropriate for sequential pattern mining. In sequential pattern mining the input is a sequence database (a set of sequences) and the goal is to find subsequences that appear frequently in these sequences. In your case, the sequence database could contain a set of sequences of clicks performed by different experts. Then by applying a sequential pattern
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Good evening, Ok, I dont know JIRA so maybe you can tell me more about this. Your data contains only one sequence ? (for one user?) What is your goal? What do you want to do using the sequential patterns? Depending if you have a single sequence or many sequences, then maybe you could use some different algorithms. Best regards,
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi again, You are welcome! Yes, that equation is another solution to that problem of how to set the minsup! Best regards, Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, In that paper, for the experiments, the minsup threshold was varied so as to show the performance gap between the algorithms. I started with a high minsup threshold and lower it down until it became very slow and i could see clearly the difference between the algorithms in terms of performance. Generally, the minsup threshold will be different on each dataset. On some datasets, a minsu
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Great happy to know that it works now! Regards
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, If you are using SPMF, you need to make sure that your file follows the correct input format. The input format is explained in the documentation. For example, for Apriori, see the example and explanation here: http://www.philippe-fournier-viger.com/spmf/Apriori.php Briefly, it could be like this: 1 3 4 2 3 5 1 2 3 5 2 5 1 2 3 5 or like that if you use the GUI: @CONVERTED
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ The 18th Australasian Data Mining Conference 2020 (AusDM'20) Canberra, Australia, 1-4 December 2020 Homepage: https://ausdm20.ausdm.org/ +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both pr
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
De: "Sabeur Aridhi" <sabeur.aridhi@loria.fr> À: "liste-egc" <liste-egc@polytech.univ-nantes.fr>, "liste-proml" <liste-proml@lists.lri.fr>, "madics-all" <madics-all@univ-grenoble-alpes.fr> Envoyé: Samedi 9 Mai 2020 01:46:21 Objet: CFP: Workshop on Advances in Managing and Mining Large Evolving Graphs (LEG) =================
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Ok glad to know that you have found the problem Best regards, Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
My e-mail is : philfv8 AT yahoo DOT com
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, I see. There could be various reasons for this problem. A possible reason could be that the input format is not correct... Maybe you may want to double check this, especially that there is a -2 at the end of each line and that items cannot appear more than once in each itemset. But if this is not the problem, to investigate further, you may send me the input file and tell me the paramet
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, 1) If you have an input file where other separators are used instead of -1 and -2 and you want to use SPMF, then you could just open the file in a text editor and replace the separators by -1 and -2 using the "find and replace" function. Another way is to write a small program using any programming language to convert your file. 2) Now, if you want to see a different type of
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi again, Glad it helps. How to choose the thresholds values depends on your data. For some datasets, there could be no patterns with a support of 0.1, while for another datasets, there could be millions of patterns having a support of 0.99. So there is not really a way of knowing what is a good support value on your data without testing it. You could first apply an algorithm like FP
Forum: The Data Mining / Big Data Forum
Pages: PreviousFirst...56789...LastNext
Current Page: 7 of 67

This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.