1 / 19

Mining Relationships Among Interval-based Events for Classification

Mining Relationships Among Interval-based Events for Classification. Dhaval Patel 、 Wynne Hsu Mong 、 Li Lee SIGMOD 08. Outline. Introduction Preliminaries Augment hierarchical representation Interval-based event mining Interval-based event classifier Experiment Conclusion.

Download Presentation

Mining Relationships Among Interval-based Events for Classification

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Mining Relationships Among Interval-based Events for Classification Dhaval Patel、Wynne Hsu Mong、Li Lee SIGMOD 08

  2. Outline. • Introduction • Preliminaries • Augment hierarchical representation • Interval-based event mining • Interval-based event classifier • Experiment • Conclusion

  3. Introduction. • Predicts categorical class labels • Classifies data (constructs a model) based on the training set and the values (class labels) in a classifying attribute and uses it in classifying new data • A Two-Step Process Model construction Model usage

  4. Introduction.(cont)

  5. Introduction.(cont)

  6. age? <=30 overcast >40 31..40 student? credit rating? yes excellent fair no yes no yes no yes Introduction.(cont)

  7. Preliminaries. • E = (type, start, end) • EL = {E1, E2,….., En} • The length of EL, given by |EL| is the number of events in the list. • Composite event E = (Ei R Ej) • The start time of E is given by min{ Ei.start, Ej.start } end time is max{Ei.end, Ej.end }

  8. Before Meet Overlap Start Finish Contain Equal Augment hierarchical representation.

  9. Augment hierarchical representation(cont.) • ((A overlap B) overlap C) • 1. 2. • (A Overlap[0,0,0,1,0] B) Overlap[0,0,0,1,0] C • C = contain count、F = finish by count M = meet count、O=overlap count S = start count

  10. Augment hierarchical representation(cont.)

  11. Augment hierarchical representation(cont.) • The linear ordering of is {{A+}{B+}{C+}{A−}{B−}{D+}{D−}{C−}}

  12. Interval-based event mining. • Candidate generation • Theorem. A (k+1)-pattern is a candidate pattern if it is generated from a frequent k-pattern and a 2-pattern where the 2-pattern occurs in at least k − 1 frequent k-patterns. • Dominant event Dominant event in the pattern P if it occurs in P and has the latest end time among all the events in P.

  13. Interval-based event mining(cont.)

  14. Interval-based event mining(cont.) • Support count

  15. IEClassifier. • Class labels Ci 1≦i ≦c, c is the number of class label • The information gain: • p(TP) is probability of pattern TP to occur in datasets. • Whose information gain values are below a predefined info_gain threshold are removed.

  16. IEClassifier.(cont) • Let PatternMatchI be the set of discriminating patterns that are contained in I

  17. Experiment.

  18. Experiment.(cont) • 對於一群資料而言,有時候我們會希望依據資料的一些特性來將這群資料分為兩群。而就資料分群而言,我們已知有一些效果不錯的方法。例如:Nearest Neighbor、類神經網路(Neural Networks)、Decision Tree等等方式,而如果在正確的使用的前提之下,這些方式的準確率相去不遠,然而,SVM 的優勢在於使用上較為容易。 • 我們希望能夠在該空間之中找出一Hyper-plan,並且,希望此Hyper-plan可以將這群資料切成兩群。

  19. Conclusion. • IEMiner algorithm • IEClassification • The performance improved • It achieved the best accuracy

More Related