1 / 68

Dynamical Processes on Large Networks

Explore the study of dynamical processes on large networks and its applications in various domains such as epidemiology, viral marketing, cyber security, human mobility, games, and ecology.

hsmith
Download Presentation

Dynamical Processes on Large Networks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Dynamical Processes on Large Networks B. AdityaPrakash http://www.cs.cmu.edu/~badityap Carnegie Mellon University MMS, SIAM AN, Minneapolis, July 10, 2012

  2. Thanks ! • Jeremy Kepner • David Bader • John Gilbert

  3. Networks are everywhere! Facebook Network [2010] Gene Regulatory Network [Decourty 2008] Human Disease Network [Barabasi 2007] The Internet [2005]

  4. Dynamical Processes over networks are also everywhere!

  5. Why do we care? • Online Information Diffusion • Viral Marketing • Epidemiology and Public Health • Cyber Security • Human mobility • Games and Virtual Worlds • Ecology ........

  6. Why do we care? (1: Epidemiology) • Dynamical Processes over networks [AJPH 2007] CDC data: Visualization of the first 35 tuberculosis (TB) patients and their 1039 contacts Diseases over contact networks

  7. Why do we care? (2: Online Diffusion) • Dynamical Processes over networks Buy Versace™! Followers Celebrity Social Media Marketing

  8. Why do we care? (3: To change the world?) • Dynamical Processes over networks Social networks and Collaborative Action

  9. High Impact – Multiple Settings epidemic out-breaks Q. How to squash rumors faster? Q. How do opinions spread? Q. How to market better? products/viruses transmit s/w patches

  10. Research Theme ANALYSIS Understanding POLICY/ ACTION Managing DATA Large real-world networks & processes

  11. Research Theme – Public Health ANALYSIS Will an epidemic happen? POLICY/ ACTION How to control out-breaks? DATA Modeling # patient transfers

  12. Research Theme – Social Media ANALYSIS # cascades in future? POLICY/ ACTION How to market better? DATA Modeling Tweets spreading

  13. In this talk Q1: How do bursts look like? Q2: How do #hashtags spread? DATA Large real-world networks & processes

  14. In this talk Given propagation models: Q2: Will an epidemic happen? ANALYSIS Understanding

  15. In this talk Q3: How to immunize and control out-breaks better? POLICY/ ACTION Managing

  16. Outline • Motivation • Learning Models (Empirical Studies) • Rise and Fall patterns • Twitter hashtags • Epidemics: what happens? (Theory) • Action: Who to immunize? (Algorithms) • Conclusion

  17. Rise and fall patterns in social media • Meme (# of mentions in blogs) • short phrases Sourced from U.S. politics in 2008 “you can put lipstick on a pig” “yes we can” Thanks to Yasuko Matsubara for some slides

  18. Rise and fall patterns in social media • Can we find a unifying model, which includes these patterns? • four classes on YouTube [Crane et al. ’08] • sixclasses on Meme [Yang et al. ’11]

  19. Rise and fall patterns in social media • Answer: YES! • We can represent all patterns by single model In SIGKDD 2012

  20. Main idea - SpikeM • 1. Un-informed bloggers (uninformed about rumor) • 2. External shock at time nb(e.g, breaking news) • 3. Infection(word-of-mouth) β Time n=0 Time n=nb Time n=nb+1 • Infectiveness of a blog-post at age n: • Strength of infection (quality of news) • Decay function (how infective a blog posting is) Power Law

  21. SpikeM - with periodicity • Full equation of SpikeM Periodicity 12pm Peak activity Bloggers change their activity over time (e.g., daily, weekly, yearly) 3am Low activity activity Time n

  22. Details • Analysis – exponential rise and power-raw fall Rise-part SI -> exponential SpikeM -> exponential Liner-log Log-log

  23. Details • Analysis – exponential rise and power-raw fall Fall-part SI -> exponential SpikeM -> power law Liner-log Log-log

  24. Tail-part forecasts • SpikeMcan capture tail part

  25. “What-if” forecasting e.g., given (1) first spike, (2) release date of two sequel movies (3) access volume before the release date (1) First spike (2) Release date (3) Two weeks before release ? ?

  26. “What-if” forecasting • SpikeM can forecast not only tail-part, but also rise-part! • SpikeMcan forecast upcoming spikes (1) First spike (2) Release date (3) Two weeks before release

  27. Outline • Motivation • Learning Models (Empirical Studies) • Rise and Fall patterns • Twitter hashtags • Epidemics: what happens? (Theory) • Action: Who to immunize? (Algorithms) • Conclusion

  28. Tweets Diffusion: Problem Definition • Given: • Action log of people tweeting a #hashtag • A network of users • Find: • How external influence varies with #hashtags? ?? ?? ? ? ? ?

  29. Tweet Diffusion: Data • Yahoo! Twitter firehose • More than 750 million tweets (> 10 Tera-bytes) • Test-bed of >6000 machines • Hadoop+PIG system ver 0.20.204.0 • Took top 500 hashtags (by volume) in Feb 2011 • Network of users: • connecting user X to user Y if X directed at least 3 @-messages to Y (or RT-ed a tweet) Under Review

  30. Tweet Diffusion • Propagation = Influence + External • Developed a model • takes the previous observations into account • with parameters representing external influence • Learn from previous data • EM-style alternating minimizing algorithm • Group tags according to learnt params

  31. Results: External Influence vs Time “External Effects” Long-running tags #nowwatching, #nowplaying, #epictweets “Word-of-mouth” #purpleglasses, #brits, #famouslies Bursty, external events #oscar, #25jan #openfollow, #ihatequotes, #tweetmyjobs “Word-of-mouth” Not trending time Can also use for Forecasting, Anomaly Detection!

  32. Outline • Motivation • Learning Models (Empirical Studies) • Epidemics: what happens? (Theory) • Action: Who to immunize? (Algorithms) • Conclusion

  33. A fundamental question Strong Virus Epidemic?

  34. example (static graph) Weak Virus Epidemic?

  35. Problem Statement # Infected above (epidemic) below (extinction) time Separate the regimes? Find, a condition under which • virus will die out exponentially quickly • regardless of initial infection condition

  36. Threshold: Why important? • Accelerating simulations • Forecasting (‘What-if’ scenarios) • Design of contagion and/or topology • A great handle to manipulate the spreading • Immunization • Maximize collaboration …..

  37. Outline • Motivation • Epidemics: what happens? (Theory) • Background • Result and Intuition (Static Graphs) • Proof Ideas (Static Graphs) • Bonus : Competing Viruses • Action: Who to immunize? (Algorithms) • Learning Models: Twitter (Empirical Studies)

  38. Background “SIR” model: life immunity (mumps) • Each node in the graph is in one of three states • Susceptible (i.e. healthy) • Infected • Removed (i.e. can’t get infected again) Prob. β Prob. δ t = 1 t = 2 t = 3

  39. Background Terminology: continued • Other virus propagation models (“VPM”) • SIS : susceptible-infected-susceptible, flu-like • SIRS : temporary immunity, like pertussis • SEIR : mumps-like, with virus incubation (E = Exposed) ….…………. • Underlying contact-network – ‘who-can-infect-whom’

  40. Background Related Work • All are about either: • Structured topologies (cliques, block-diagonals, hierarchies, random) • Specific virus propagation models • Static graphs • R. M. Anderson and R. M. May. Infectious Diseases of Humans. Oxford University Press, 1991. • A. Barrat, M. Barthélemy, and A. Vespignani. Dynamical Processes on Complex Networks. Cambridge University Press, 2010. • F. M. Bass. A new product growth for model consumer durables. Management Science, 15(5):215–227, 1969. • D. Chakrabarti, Y. Wang, C. Wang, J. Leskovec, and C. Faloutsos. Epidemic thresholds in real networks. ACM TISSEC, 10(4), 2008. • D. Easley and J. Kleinberg. Networks, Crowds, and Markets: Reasoning About a Highly Connected World. Cambridge University Press, 2010. • A. Ganesh, L. Massoulie, and D. Towsley. The effect of network topology in spread of epidemics. IEEE INFOCOM, 2005. • Y. Hayashi, M. Minoura, and J. Matsukubo. Recoverable prevalence in growing scale-free networks and the effective immunization. arXiv:cond-at/0305549 v2, Aug. 6 2003. • H. W. Hethcote. The mathematics of infectious diseases. SIAM Review, 42, 2000. • H. W. Hethcote and J. A. Yorke. Gonorrhea transmission dynamics and control. Springer Lecture Notes in Biomathematics, 46, 1984. • J. O. Kephart and S. R. White. Directed-graph epidemiological models of computer viruses. IEEE Computer Society Symposium on Research in Security and Privacy, 1991. • J. O. Kephart and S. R. White. Measuring and modeling computer virus prevalence. IEEE Computer Society Symposium on Research in Security and Privacy, 1993. • R. Pastor-Santorras and A. Vespignani. Epidemic spreading in scale-free networks. Physical Review Letters 86, 14, 2001. • ……… • ……… • ………

  41. Outline • Motivation • Epidemics: what happens? (Theory) • Background • Result and Intuition (Static Graphs) • Proof Ideas (Static Graphs) • Bonus : Competing Viruses • Action: Who to immunize? (Algorithms) • Learning Models: Twitter (Empirical Studies)

  42. How should the answer look like? ….. • Answer should depend on: • Graph • Virus Propagation Model (VPM) • But how?? • Graph – average degree? max. degree? diameter? • VPM – which parameters? • How to combine – linear? quadratic? exponential?

  43. Static Graphs: Our Main Result • Informally, • For, • any arbitrary topology (adjacency • matrix A) • any virus propagation model (VPM) in • standard literature • the epidemic threshold depends only • on the λ,firsteigenvalueof A,and • some constant , determined by the virus propagation model λ • No epidemic if λ * < 1 In Prakash+ ICDM 2011 (Selected among best papers).

  44. Our thresholds for some models s = effective strength s < 1 : below threshold

  45. Our result: Intuition for λ “Official” definition: “Un-official” Intuition  λ ~ # paths in the graph • Let A be the adjacency matrix. Then λ is the root with the largest magnitude of the characteristic polynomial of A [det(A – xI)]. • Doesn’t give much intuition! u u ≈ . (i, j) = # of paths i j of length k

  46. Largest Eigenvalue (λ) better connectivity higher λ λ ≈ 2 λ = N λ = N-1 λ ≈ 2 λ= 31.67 λ= 999 N = 1000 N nodes

  47. Examples: Simulations – SIRS (pertusis) Fraction of Infections Footprint (a) Infection profile (b) “Take-off” plot PORTLAND graph 31 million links, 6 million nodes Time ticks Effective Strength

  48. Outline • Motivation • Epidemics: what happens? (Theory) • Background • Result and Intuition (Static Graphs) • Proof Ideas (Static Graphs) • Bonus : Competing Viruses • Action: Who to immunize? (Algorithms) • Learning Models: Twitter (Empirical Studies)

  49. General VPM structure Model-based See paper for full proof λ * < 1 Graph-based Topology and stability

  50. Outline • Motivation • Epidemics: what happens? (Theory) • Background • Result and Intuition (Static Graphs) • Proof Ideas (Static Graphs) • Bonus : Competing Viruses • Action: Who to immunize? (Algorithms) • Learning Models: Twitter (Empirical Studies)

More Related