160 likes | 306 Views
Predictive Blacklisting as an Implicit Recommendation System. Authors: Fabio Soldo, Anh Le, Athina Markopoulou IEEE INFOCOM 2010 Reporter: Jing Chiu Advisor: Yuh-Jye Lee Email: D9815013@mail.ntust.edu.tw. Outlines. Introduction Blacklists Recommendation System Related Works LWOL GWOL
E N D
Predictive Blacklisting as an Implicit Recommendation System Authors: Fabio Soldo, Anh Le, Athina Markopoulou IEEE INFOCOM 2010 Reporter: Jing Chiu Advisor: Yuh-Jye Lee Email: D9815013@mail.ntust.edu.tw Data Mining & Machine Learning Lab
Outlines • Introduction • Blacklists • Recommendation System • Related Works • LWOL • GWOL • HPB • Room for improvement • DSHIELD Dataset Observation • Model Overview • Time Series • EWMA • Neighborhood Model • kNN • CA • Evaluation • Conclusions Data Mining & Machine Learning Lab
Introduction • Blacklists • Recommendation System Data Mining & Machine Learning Lab
Related Works • Local Worst Offender List(LWOL) • Global Worst Offender List(GWOL) • Highly Predictive Blacklisting(HPB) • J. Zhang, P. Porras, and J. Ullrich, “Highly predictive blacklisting,” in Proc. of USENIX Security ’08 (Best Paper award), San Jose, CA, USA, Jul. 2008, pp. 107–122. Data Mining & Machine Learning Lab
Room for improvement Data Mining & Machine Learning Lab
DSHIELD Dataset Observation Data Mining & Machine Learning Lab
DSHIELD Dataset Observation(cont.) Data Mining & Machine Learning Lab
DSHIELD Dataset Observation(cont.) Data Mining & Machine Learning Lab
Model Overview • Time Series for Attack Prediction • Exponential Weighted Moving Average(EWMA) • Neighborhood Model • Victim Neighborhood (kNN) • k-nearest neighbor • Pearson correlation as similarity metric • Joint Attacker-Victim Neighborhood (CA) • cross-associations • Fully automatic clustering algorithm that finds row and column groups of sparce binary matrices Data Mining & Machine Learning Lab
Evaluations • Local approaches • Global (neighborhood) approaches • Proposed combined method • Robustness Data Mining & Machine Learning Lab
Evaluations (cont.) Data Mining & Machine Learning Lab
Evaluations (cont.) Data Mining & Machine Learning Lab
Evaluations (cont.) Data Mining & Machine Learning Lab
Evaluations (cont.) Data Mining & Machine Learning Lab
Conclusions • Frame the problem as an implicit recommendation system • Analyze a real dataset of 1-month logs from Dshield.rg • Shows that even larger improvement can be obtained • Give a methodological development with improvement over state-of-the-art. Data Mining & Machine Learning Lab
Thanks for your attention • Questions? Data Mining & Machine Learning Lab