The Pattern Mining Course (BETA)

banner

Philippe Fournier-Viger
Distinguished professor, Ph.D.
https://www.philippe-fournier-viger.com

Introduction

This is a free online course about pattern mining. It is designed to introduce students or researchers to the different topics of pattern mining, and explain the key algorithms and key concepts.

Pattern mining is a subfield of data mining that aim at applying algorithms to discover interesting patterns in data. These patterns can be used to understand the data or to support decision-making or tasks such as prediction.

This course consists of multiple lectures, where some videos are provided for each lecture. In general, it is not necessary to watch all the content. Someone could skip some topics as needed..

Note: this is a beta version of the course. I will add videos to explain more topics, when I have time, and more exercises with answers. Thus, this page will evolve over time with more content.

If you have any comments or suggestions, you may send me an e-mail or post a message in the data mining forum.

Lectures

#

Topic

Exercises

1



Introduction
2

Frequent itemset mining and association rule mining

Additional ressource(s):
  • Fournier-Viger, P., Lin, J. C.-W., Vo, B, Chi, T.T., Zhang, J., Le, H. B. (2017). A Survey of Itemset Mining. WIREs Data Mining and Knowledge Discovery, Wiley, e1207 doi: 10.1002/widm.1207, 18 pages.
  • Luna, J. M., Fournier-Viger, P., Ventura, S. (2019). Frequent Itemset Mining: a 25 Years Review. WIREs Data Mining and Knowledge Discovery, Wiley, 9(6):e1329. DOI: 10.1002/widm.1329
3

Concise representations of pattern

  • Maximal, closed and generator itemsets (pdf / ppt / video - 50 min)
  • 4

    Rare Pattern Mining

    5

    High Utility Itemset Mining
    • Questions about high utility itemsets
    6 Sequential pattern mining

    Additional ressource(s):

    • Questions about sequential pattern mining

    7

    Episode Mining

    • ...
    ...
    8

    Subgraph Mining

    ...
    9

    Other topics

    • Periodic pattern mining
    • Interactive pattern mining
    • Classification using patterns
      ...
    ...

    Software and datasets

    To try the different pattern mining algorithms discussed in this course, you can download the SPMF data mining software. SPMF offers over 230 algorithms with their source code in Java. It can also be called from other programming languages through unoficial wrappers. Besides, you can find several public datasets to try the algorithms from SPMF on the datasets page of SPMF

    open-source data mining software

    More videos on pattern mining

    If you want to see more videos on pattern mining, you may also check:
    - The video page on the SPMF website: SPMF: A Java Open-Source Data Mining Library (philippe-fournier-viger.com)
    - My Youtube channel: https://www.youtube.com/channel/UCk26EiKTBxk1NAQniOV_oyQ/

    Bibliography

    This course is based on content from research articles mentioned in the PPTs and PDFs and also some information from those books:

    1. Han and Kamber (2011), Data Mining: Concepts and Techniques, 3rd edition, Morgan Kaufmann Publishers,
    2. Tan, Steinbach & Kumar (2006), Introduction to Data Mining, Pearson education, ISBN-10: 0321321367.
    3. Data Mining: The Textbook by Aggarwal (2015)
    4. Data Mining and Analysis Fundamental Concepts and Algorithms by Zaki & Meira (2014)