SPMF: A Java Open-Source Data Mining Library

SPMF Documentation

Documentation

This section provides examples of how to use the SPMF open-source data mining library. to perform various data mining tasks.

If you have any question or if you want to report a bug, you can check the FAQ, post in the forum or contact me. You can also have a look at the various articles that I have referenced on the algorithms page of this website to learn more about each algorithm.

List of examples

Itemset Mining (Frequent Itemsets, Rare Itemsets, etc.)

Example 1 : Mining Frequent Itemsets by Using the Apriori Algorithm
Example 2 : Mining Frequent Itemsets by Using the AprioriTID Algorithm
Example 3 : Mining Frequent Itemsets by Using the FP-Growth Algorithm
Example 4 : Mining Frequent Itemsets by Using the Relim Algorithm
Example 5 : Mining Frequent Itemsets by Using the Eclat / dEclat Algorithm
Example 6 : Mining Frequent Itemsets by Using the H-Mine Algorithm
Example 7 : Mining Frequent Itemsets by Using the FIN Algorithm
Example 8 : Mining Frequent Itemsets by Using the DFIN Algorithm
Example 9 : Mining Frequent Itemsets by Using the NegFIN Algorithm
Example 10 : Mining Frequent Itemsets by Using the PrePost / PrePost+ Algorithm
Example 11 : Mining Frequent Itemsets by Using the LCMFreq Algorithm
Example 12 : Mining Frequent Closed Itemsets Using the AprioriClose Algorithm
Example 13 : Mining Frequent Closed Itemsets Using the DCI_Closed Algorithm
Example 14 : Mining Frequent Closed Itemsets Using the Charm / dCharm Algorithm
Example 15 : Mining Frequent Closed Itemsets Using the LCM Algorithm
Example 16 : Mining Frequent Closed Itemsets Using the FPClose Algorithm
Example 17 : Mining Frequent Closed Itemsets Using the NAFCP Algorithm
Example 18 : Mining Frequent Maximal Itemsets Using the FPMax Algorithm
Example 19 : Mining Frequent Maximal Itemsets Using the Charm-MFI Algorithm
Example 20 : Mining Frequent Generator Itemsets Using the DefMe Algorithm
Example 21 : Mining Frequent Itemsets and Identify the Generators Using the Pascal Algorithm
Example 22 : Mining Frequent Closed Itemsets and Minimal Generators Using the Zart Algorithm
Example 23 : Mining Minimal Rare Itemsets Using the AprioriRare Algorithm
Example 24 : Mining Perfectly Rare Itemsets Using the AprioriInverse Algorithm
Example 25 : Mining Rare Correlated Itemsets Using the CORI Algorithm
Example 26 : Mining Rare Itemsets Using the RP-Growth Algorithm
Example 27 : Mining Closed Itemsets from a Data Stream Using the CloStream Algorithm (source code version only)
Example 28 : Mining Recent Frequent Itemsets from a Data Stream Using the estDec Algorithm (source code version only)
Example 29 : Mining Recent Frequent Itemsets from a Data Stream Using the estDec+ Algorithm (source code version only)
Example 30 : Mining Frequent Itemsets from Uncertain Data with the UApriori Algorithm
Example 31 : Mining Erasable Itemsets from a Product Database with the VME algorithm
Example 32 : Building, updating incrementally and using an Itemset-Tree to generate targeted frequent itemsets and association rules (source code version only)
Example 33 : Building, updating incrementally and using a Memory-Efficient Itemset-Tree to generate targeted frequent itemsets and association rules (source code version only)
Example 34 : Mining Frequent Itemsets with Multiple Support Thresholds Using the MSApriori Algorithm
Example 35 : Mining Frequent Itemsets with Multiple Support Thresholds Using the CFPGrowth++ Algorithm
Example 36 : Mining Fuzzy Frequent Itemsets in a quantitative transaction database using the FFI-Miner algorithm
Example 37 : Mining Multiple Fuzzy Frequent Itemsets in a quantitatve transaction database using the MFFI-Miner algorithm
Example 38 : Deriving Frequent Itemsets from Frequent Closed Itemsets using the LevelWise algorithm
Example 39 : Deriving Frequent Itemsets from Frequent Closed Itemsets using the DFI-Growth algorithm
Example 40 : Mining Self-Sufficient Itemsets using the Opus-Miner algorithm

High-Utility Pattern Mining

Association Rule Mining

Clustering

Sequential Pattern Mining

Sequential Rule Mining

Sequence Prediction (source code version only)

Periodic pattern mining

Episode Mining

Graph Pattern Mining

Text Mining

Example 180 : Clustering Texts with a text clusterer
Example 181 : Classifying Text documents using a Naive Bayes approach (source code version only)

Time Series Mining

Example 182 : Vizualize time series using the time series viewer
Example 183 : Calculate the prior moving average of time series
Example 184 : Calculate the cumulative moving average of time series
Example 185 : Calculate the central moving average of time series
Example 186 : Calculate the min max normalization of a time series
Example 187 : Calculate the standardization of a time series
Example 188 : Calculate the median smoothing of a time series
Example 189 : Calculate the exponential smoothing of a time series
Example 190 : Calculate the first order differencing of a time series
Example 191 : Calculate the second order differencing of a time series
Example 192 : Calculate the piecewise aggregate approximation of time series
Example 193 : Calculate the autocorelation function of a time series
Example 194 : Calculate the regression line of a time series using the least square method, and perform time series forecasting
Example 195 : Split time series by length
Example 196 : Split time series by number of segments
Example 197 : Convert time series to sequences using the SAX algorithm (useful to then apply sequential pattern mining/rule algorithms)

Besides the above example for time series mining, clustering algorithms such as K-Means can also be applied to time-series.

Classification

Example 198 : Creating a decision tree with the ID3 algorithm to predict the value of a target attribute (source code version only)

Tools

Copyright © 2008-2024 Philippe Fournier-Viger. All rights reserved.