240 likes | 260 Views
Big (graph) data analytics. Christos Faloutsos CMU. Outline. Problem definition / Motivation Anomaly detection Time series analysis Conclusions. Motivation. Data mining: ~ find patterns (rules, outliers) How do real graphs look like? Anomalies? Time series / Monitoring.
E N D
Big (graph) data analytics Christos Faloutsos CMU
Outline • Problem definition / Motivation • Anomaly detection • Time series analysis • Conclusions C. Faloutsos
Motivation • Data mining: ~ find patterns (rules, outliers) • How do real graphs look like? Anomalies? • Time series / Monitoring Measles @ PA, NY, … C. Faloutsos
Graphs - why should we care? C. Faloutsos
Graphs - why should we care? Food Web [Martinez ’91] ~1B users $10-$100B revenue Internet Map [lumeta.com] C. Faloutsos
Outline • Problem definition / Motivation • Anomaly/fraud detection • Financial fraud • Ebay fraud • Time Series Analysis • Conclusions C. Faloutsos
Network Effect Tools: SNARE • Some accounts are sort-of-suspicious – how to combine weak signals? Before C. Faloutsos
Network Effect Tools: SNARE • A: Belief Propagation. Before C. Faloutsos
Network Effect Tools: SNARE • A: Belief Propagation. Before After Mary McGlohon, Stephen Bay, Markus G. Anderle, David M. Steier, Christos Faloutsos: SNARE: a link analytic system for graph labeling and risk detection. KDD 2009: 1265-1274 C. Faloutsos
Network Effect Tools: SNARE • Produces improvement over simply using flags • Up to 6.5 lift • Improvement especially for low false positive rate Results for accounts data (ROC Curve) Ideal SNARE Baseline (flags only) True positive rate C. Faloutsos False positive rate
Network Effect Tools: SNARE • Accurate- Produces large improvement over simply using flags • Flexible- Can be applied to other domains • Scalable- One iteration BP runs in linear time (# edges) • Robust- Works on large range of parameters C. Faloutsos
Outline • Problem definition / Motivation • Anomaly/fraud detection • Financial fraud • Ebay fraud • Time series analysis • Conclusions C. Faloutsos
E-bay Fraud detection Detects ‘non-delivery’ fraud: seller takes $$ and disappears Shashank Pandit, Duen Horng Chau, Samuel Wang, and Christos Faloutsos. NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks WWW 07. C. Faloutsos
E-bay Fraud detection - NetProbe C. Faloutsos
‘Tycho’ – epidemics analysis Yasuko Matsubara 50 states x 46 diseases C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara Flu? Measles? August? No periodicity? C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara Flu? Measles? August? No periodicity? C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara Flu? Measles? August? No periodicity? C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara Flu? Measles? August? No periodicity? C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara Flu? Measles? August? No periodicity? C. Faloutsos
‘Tycho’ – epidemics analysis Prof. Yasuko Matsubara https://www.tycho.pitt.edu/resources.php from U. Pitt (epidemiology dept.) Yasuko Matsubara, Yasushi Sakurai, Willem van Panhuis, and Christos Faloutsos, FUNNEL: Automatic Mining of Spatially Coevolving Epidemics, KDD 2014, New York City, NY, USA, Aug. 24-27, 2014. C. Faloutsos
Open research questions • Patterns/anomalies for time-evolving graphs (Call graph, 3M people x 6mo) • Spot fraudsters in soc-net (eg., Twitter ‘$10 -> 1000 followers’) C. Faloutsos
Contact info • www.cs.cmu.edu/~christos • GHC 8019 • Ph#: x8.1457 C. Faloutsos