The Data Mining Forum                             open-source data mining software data mining conferences Data Science for Social and Behavioral Analytics DSSBA 2022 data science journal
IMPORTANT: This is the old Data Mining forum.
I keep it online so that you can read the old messages.

Please post your new messages in the new forum: https://forum2.philippe-fournier-viger.com/index.php
 

Pages: PreviousFirst...34567...LastNext
Current Page: 5 of 67
Results 121 - 150 of 2010
3 years ago
webmasterphilfv
Hi, I was a bit busy. I still did not finish for the new version. Thanks for reminding me about the feature. I will try to see if there is a way to integrate it. But at the same time this week is very busy! Let's see.. I will post a message here also when the new version is released. There will be maybe 5 new algorithms. But I need to do the integration, some testing and update the websites
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Yes, you could see a text document as a sequence of events (words) and find the frequent episodes. This could tell you about how people write some frequent sequences of words in a document. Then after that you could use these patterns to do different things maybe.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, High dimensional data just means that you have many dimensions (features/attributes/properties) to decribe your data. In real-life, we sometimes have the low-dimensional data. For example, you could have data about customers in a store, where each customer only have a few attributes like : name, birthdate, phone number, e-mail Or you could have data with many dimensions. For example,
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Thanks for your interest. We are currently preparing the datasets and code for release. The code of CLH-Miner and TKC for top-k cross level high utility itemset mining will be released in about 10 days. I can send you the datasets earlier, if you send me an e-mail to philfv8 AT yahoo DOT COM Best regards Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Taras, Glad SPMF is useful. Good question. I am the founder of SPMF but I am not a user of R... So it is hard for me to provide help for the unofficial wrapper for R or to debug it. But I have searched a bit, and it seems that such error comes from a problem related to perhaps the input file format? like maybe some character that is forbidden is in the file... Actually, the error app
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, thanks for sharing your idea. Yes, that is another possibility. I think that I have to think about it for a while to see what is the best way to make it easy to use and yet keep the user interface simple to use and intuitive. Also, I need to find some time to modify the code... I may take a little while because this week I have many urgent things to do. Maybe the week after, I can find the
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
127. Re: EFIM
frank Wrote: ------------------------------------------------------- > Hi > thank you for help.in line 4 of algorithm 2 in > efim close when we want to check backward > extention of item beta we scan database alfa-D or > beta-D? > Regards Hi Frank, I will explain a bit. We have an itemset BETA and we want to check if it is closed. Thus we will check if it has a ba
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
128. Re: A bug
Hi, I see. You can send me your dataset to philfv8 AT yahoo DOT com and tell me what is the minimum support value, and I can test it to see what is the problem. If there was a bug, I guess it would be in the NegFin algorithm, since other algorithms like FPGrowth have been quite extensively tested and used by many people but we never know. I can check to find out what is the problem.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
I am not sure what has been done recently for pattern mining in streams. I did not read on this topic for a while.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Actually, machine learning is quite broad and even pattern mining can be viewed as machine learning. But I think that you are referring perhaps to neural networks? I know that some papers such as those by Dang Nguyen have used pattern mining in conjunction with some neural networks, if I remember. But I am not aware of papers that have used neural networks to find the patterns. But I g
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
I see. GUI should be not very hard to do... now i am a bit overloaded but maybe in 1 week or so, I can have time to check and try to do it. Otherwise maybe a bit later I will let you know if I do it. Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
132. Re: EFIM
Hi, Glad the article has been helpful to understnad the algorithm. I currently do not have detailed example for the EFIM-Closed algorithm and these days I am overloaded with work so I cannot really prepare one. But there is also a PPT presentation about this algorithm: http://www.philippe-fournier-viger.com/EFIM_and_EFIM-Closed_high_utility_mining.pdf It contains a few slides about EF
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Yes, I see. Maybe not too complicated to do... Do you mean from the command line interface or from the GUI ? If from the GUI, I think it would require the following changes: - change the dialog for choosing files to let the user choose more than one file - if more than one file are chosen, then apply the same algorithm on each file, one after the other. Maybe the names could be
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Thanks for the suggestion. I will consider it. I am not sure about how to best do it, but I will think about it. I also want to add some features to let the user run multiple algorithms one after the other. I think it would go in the same direction as your idea. But I need some time to find a way to do it and do it. Thanks for the suggestion
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, explaining the whole algorithm would take a lot of time. So I recommend to read this chapter from the book of Tan & Kumar: https://www-users.cs.umn.edu/~kumar001/dmbook/ch5_association_analysis.pdf It is free to read and has some example about FPGrowth. I think it can help you to understand it.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
An horizontal database: Transaction1: item1 item3 item4 Transaction2: item1 item2 Transaction3: item2 item4 The corresponding vertical database: Item1: Transaction1 Transaction2 Item2: Transaction2 Item3: Transaction1 Item4: Transaction1 Transaction2 Both are equivalent. In a horizontal database, the rows are transactions. In a vertical database, the rows are items
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Yes, there is still a lot of thing can be done. Between the frequent itemset mining and high utility mining there are also several differences, so some strategies for frequent itemset mining in stream may not work directly for high utility itemset mining and may need to be adapted. This can lead to some new contributions. Besides, a good idea is also to make some new problems by combining s
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Ashish, SCI is more restrictive than SCIE. SCI is an index for journal that was proposed many years ago. Then SCIE was proposed to include more journals. You can think of SCIE as an extension of SCI. That is why SCIE means SCI Expanded. So basically SCI can be viewed as more selective than SCIE. SCI includes some older journals that have a more well-established reputation than some new
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Muhammad, Nice to receive your message. ESWA is a good journal. Wish your paper will have success there. HUIM is a popular topic. There are several papers about this topic in journals likes Knowledge-based systems, Information Sciences, Applied Intelligence, etc. About stream mining, I participated to a recent paper on this topic for high utility itemset mining: Duong, H., Ramampiar
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Neni, Thanks for your message. I am not sure at the moment. The student who did this paper has graduated a year ago. The FIFA dataset is on the SPMF website as you know. By looking at the code, I think he used these two parameters to group transactions: // whether convert the transaction database to a sequential database or not boolean needGroup = false; // if
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
I see. So from what I understand, the client wants a tool to analyze text data. But there are many things that can be done with text data. If you know the goal (e.g. sentiment analysis, translation, etc.), maybe it could help to recommend some tool.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi, Yes,it would certainly be possible to apply sequential pattern mining in video games. A sequence of actions done by a video game character or the player can be viewed as a sequence of events. Thus sequential pattern mining can be applied. Philippe
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Dorothy, Which languages are you interested in? Best regards,
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Hi Jan, I think it depends on your data. I do not know any models specifically designed for that. But you may want to try different classification models and see which one works best. It would depend on the characteristics of your data. Related to this, there are some models for imbalanced data. Imbalanced means that there are much more records of a class A than of a class B for example. But
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Welcome to submit your papers! For any information, you can ask me.
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Special Session on Intelligent Knowledge Engineering in Decision Making Systems (IKEDS2021) https://ieaaie2021.wordpress.com/special-session-2-ikeds/ held at 34th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE 2021) Overview Intelligent Knowledge Engineering (analytics and mining) has become an emerging research t
Forum: The Data Mining / Big Data Forum
3 years ago
webmasterphilfv
Special Session on AI in Healthcare (AIH2021) https://ieaaie2021.wordpress.com/special-session-7-aih/ 34th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE 2021) Overview Healthcare has always been an important area, for both the governments and the population, where the technology has been playing important roles in t
Forum: The Data Mining / Big Data Forum
Pages: PreviousFirst...34567...LastNext
Current Page: 5 of 67

This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.