1 / 11

Summarization of Frequent Pattern Mining

Summarization of Frequent Pattern Mining . What is FPM?. Why being frequent is so important? Application of FPM Decision make/Business Software Debugging Bioinformatics Other data mining tasks Indexing Clustering/Classification/Association Rule . What have been done.

Mia_John
Download Presentation

Summarization of Frequent Pattern Mining

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Summarization of Frequent Pattern Mining

  2. What is FPM? • Why being frequent is so important? • Application of FPM • Decision make/Business • Software Debugging • Bioinformatics • Other data mining tasks • Indexing • Clustering/Classification/Association Rule

  3. What have been done • Frequent Itemset Mining • Frequent Sequential Pattern Mining • Frequent Subgraph Mining • Frequent Tree Mining • Mining A Single Large Graph • Frequent motifs

  4. FPM is a way to think E A A E B B A A A B B A A B B F E A A B C E A B F C C D D F C C D C D C D D A D F D C A B D C

  5. Algorithm Foundations • Apriori Property • Enumeration Algorithm • Level-wise search • Depth-first search • Data structure • For Patterns • For Data

  6. Lattice

  7. Apriori R. Agrawal and R. Srikant. Fast algorithms for mining association rules. VLDB, 487-499, 1994

  8. Resource and Tools • Important FPM websites • FIMI workshop website • http://fimi.cs.helsinki.fi/ • Mining Structure Data website • http://hms.liacs.nl/graphs.html • Commercial Databases • Oracle, IBM DB2, SQL Server • General Data Mining Information • KDDNuggets (general/job/software, etc) • Weka (www.cs.waikato.ac.nz/ml/weka/)

  9. Why FPM does not work? • Too many patterns? • What can we do? • Pattern Pruning • Additional constraints? • Pattern summarization • Representative Patterns? • Pattern Ranking

  10. What is missing • The common foundation for FPM, clustering, classification, etc… • FPM formalization • language/compiler/automatic discovery • FPM understanding • How and why they are being generated? • The relationship between dataset and pattern

  11. How FIM relate to the underlying structure of the dataset?

More Related