The Data Mining Forum                             open-source data mining software data mining conferences Data Science for Social and Behavioral Analytics DSSBA 2022 data science journal
IMPORTANT: This is the old Data Mining forum.
I keep it online so that you can read the old messages.

Please post your new messages in the new forum: https://forum2.philippe-fournier-viger.com/index.php
 
Threshold raising strategies - LIU-LB
Posted by: solmaz
Date: February 01, 2020 02:19PM

Hello, Prof. Philippe Fournier-Viger.

Have you read paper "Mining top-k high utility itemsets with effective threshold raising strategies" from Krishnamoorthy (2019)?

In section 4.2.2 , Why is the maximum number of breakpoints considered 3 (q=3 , contiguous)?

If we have more breakpoints in one itemset, what should we do?

Options: ReplyQuote
Re: Threshold raising strategies - LIU-LB
Posted by: srikumar
Date: February 01, 2020 09:34PM

It is a heuristic and you can certainly use higher values of q. There is also downside to using higher values of q as mentioned in section 4.2.2 (below definition 18) of the paper. Please also refer to the example given in Figure 3, where the utilities of subsets are estimated using fdaec and daec. One can observe that utility estimate of fac is 15 (actual 57), estimate of fc is -30 (actual 22). As you remove more items, the estimate is likely to fall dramatically. Essentially, this might lead to wasted computation. One can also set a dynamically threshold (not considered in the paper) and stop generation of subsets when the estimate becomes zero or negative.

Hope this clarifies.

Options: ReplyQuote
Re: Threshold raising strategies - LIU-LB
Posted by: solmaz
Date: February 02, 2020 08:47AM

Thank you. Thank you. This issue is clarifies now.

Options: ReplyQuote


This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.