Questions about Rare Patterns
(from the Pattern Mining Course)
Click on a question to see the answer.
What are the problems with this definition?
There are generally too many infrequent itemsets. And some infrequent itemsets may not even appear in the data ( they may have a support of 0).
An itemset X is a minimal rare itemset if sup(X) < minsup and all the proper subsets of X are frequent itemsets.
An itemset X is a perfectly rare itemset if sup(X)โฅ minsup, sup(X)<maxsup and for any non empty subset ๐ โ ๐, sup(Y) โค maxsup.
Generally, frequent itemset mining algorithms start from single items and combine them to find larger itemsets.
As itemsets become larger, the support can decrease.
Searching for minimal rare itemsets is challenging because to reach the minimal rare itemsets, we must pass through ยป the frequent itemsets. In other words, we cannot start by directly searching for minimal rare itemsets without considering how to pass through the frequent itemsets.
Contrarily to Apriori, AprioriInverse has two parameters : minsup and maxsup.
The main difference between Apriori and AprioriInverse from an algorithmic perspective, is that AprioriInverse will initially discards all items that have a support that is greater than maxsup. Then, AprioriInverse will search for frequent itemsets using the same procedure as Apriori.