1.29k likes | 1.41k Views
Finding patterns in large, real networks. Christos Faloutsos CMU. Thanks to. Deepayan Chakrabarti (CMU/Yahoo) Michalis Faloutsos (UCR) Spiros Papadimitriou (CMU/IBM) George Siganos (UCR). Introduction. Protein Interactions [genomebiology.com]. Internet Map [lumeta.com].
E N D
Finding patterns in large, real networks Christos Faloutsos CMU
Thanks to • Deepayan Chakrabarti (CMU/Yahoo) • Michalis Faloutsos (UCR) • Spiros Papadimitriou (CMU/IBM) • George Siganos (UCR) C. Faloutsos
Introduction Protein Interactions [genomebiology.com] Internet Map [lumeta.com] Food Web [Martinez ’91] Graphs are everywhere! Friendship Network [Moody ’01] C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • tools/results • future steps C. Faloutsos
Motivating questions • Q1: What do real graphs look like? • Q2: How to generate realistic graphs? C. Faloutsos
Virus propagation • Q3: How do viruses/rumors propagate? • Q4: Who is the best person/computer to immunize against a virus? C. Faloutsos
Graph clustering & mining • Q5: which edges/nodes are ‘abnormal’? • Q6: split a graph in k ‘natural’ communities - but how to determine k? C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • tools/results • future steps C. Faloutsos
Topology Q1: How does the Internet look like? Any rules? (Looks random – right?) C. Faloutsos
Laws – degree distributions • Q: avg degree is ~2 - what is the most probable degree? count ?? degree 2 C. Faloutsos
WRONG ! count ?? count degree 2 2 Laws – degree distributions • Q: avg degree is ~2 - what is the most probable degree? degree C. Faloutsos
I.Power-law: outdegree O The plot is linear in log-log scale [FFF’99] freq = degree (-2.15) Frequency Exponent = slope O = -2.15 -2.15 Nov’97 Outdegree C. Faloutsos
-0.82 att.com log(degree) ibm.com log(rank) II.Power-law: rank R • The plot is a line in log-log scale April’98 R Rank: nodes in decreasing degree order C. Faloutsos
III.Power-law: eigen E Eigenvalue • Eigenvalues in decreasing order (first 20) • [Mihail+, 02]: R = 2 * E Exponent = slope E = -0.48 Dec’98 Rank of decreasing eigenvalue C. Faloutsos
IV. The Node Neighborhood • N(h) = # of pairs of nodes within h hops C. Faloutsos
IV. The Node Neighborhood • Q: average degree = 3 - how many neighbors should I expect within 1,2,… h hops? • Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 … h hops -> 3h C. Faloutsos
IV. The Node Neighborhood • Q: average degree = 3 - how many neighbors should I expect within 1,2,… h hops? • Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 … h hops -> 3h WRONG! WE HAVE DUPLICATES! C. Faloutsos
IV. The Node Neighborhood • Q: average degree = 3 - how many neighbors should I expect within 1,2,… h hops? • Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 … h hops -> 3h WRONG x 2! ‘avg’ degree: meaningless! C. Faloutsos
IV. Power-law: hopplot H H = 2.83 # of Pairs H = 4.86 # of Pairs Pairs of nodes as a function of hops N(h)= hH Hops Router level ’95 Dec 98 Hops C. Faloutsos
... Observation • Q: Intuition behind ‘hop exponent’? • A: ‘intrinsic=fractal dimensionality’ of the network N(h) ~ h1 N(h) ~ h2 C. Faloutsos
Hop plots • More on fractal/intrinsic dimensionalities: very soon C. Faloutsos
Any other ‘laws’? Yes! C. Faloutsos
Any other ‘laws’? Yes! • Small diameter • six degrees of separation / ‘Kevin Bacon’ • small worlds [Watts and Strogatz] C. Faloutsos
Any other ‘laws’? • Bow-tie, for the web [Kumar+ ‘99] • IN, SCC, OUT, ‘tendrils’ • disconnected components C. Faloutsos
Any other ‘laws’? • power-laws in communities (bi-partite cores) [Kumar+, ‘99] Log(count) n:1 n:3 n:2 2:3 core (m:n core) Log(m) C. Faloutsos
More Power laws • Also hold for other web graphs [Barabasi+, ‘99], [Kumar+, ‘99] • citation graphs (see later) • and many more C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • tools/results • future steps C. Faloutsos
How do graphs evolve? C. Faloutsos
Time Evolution: rank R Domain level #days since Nov. ‘97 The rank exponent has not changed! [Siganos+, ‘03] C. Faloutsos
How do graphs evolve? • degree-exponent seems constant - anything else? C. Faloutsos
Evolution of diameter? • Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(N))) • i.e.., slowly increasing with network size • Q: What is happening, in reality? C. Faloutsos
Evolution of diameter? • Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(N))) • i.e.., slowly increasing with network size • Q: What is happening, in reality? • A: It shrinks(!!), towards a constant value C. Faloutsos
Shrinking diameter [Leskovec+05a] • Citations among physics papers • 11yrs; @ 2003: • 29,555 papers • 352,807 citations • For each month M, create a graph of all citations up to month M C. Faloutsos
Shrinking diameter • Authors & publications • 1992 • 318 nodes • 272 edges • 2002 • 60,000 nodes • 20,000 authors • 38,000 papers • 133,000 edges C. Faloutsos
Shrinking diameter • Patents & citations • 1975 • 334,000 nodes • 676,000 edges • 1999 • 2.9 million nodes • 16.5 million edges • Each year is a datapoint C. Faloutsos
Shrinking diameter • Autonomous systems • 1997 • 3,000 nodes • 10,000 edges • 2000 • 6,000 nodes • 26,000 edges • One graph per day C. Faloutsos
Temporal evolution of graphs • N(t) nodes; E(t) edges at time t • suppose that N(t+1) = 2 * N(t) • Q: what is your guess for E(t+1) =? 2 * E(t) C. Faloutsos
Temporal evolution of graphs • N(t) nodes; E(t) edges at time t • suppose that N(t+1) = 2 * N(t) • Q: what is your guess for E(t+1) =? 2 * E(t) • A: over-doubled! x C. Faloutsos
Temporal evolution of graphs • A: over-doubled - but obeying: E(t) ~ N(t)a for all t where 1<a<2 C. Faloutsos
Temporal evolution of graphs • A: over-doubled - but obeying: E(t) ~ N(t)a for all t • Identically: log(E(t)) / log(N(t)) = constant for all t C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: Physics papers and their citations 1.69 C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: Physics papers and their citations 1 1.69 ‘tree’ C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: Physics papers and their citations ‘clique’ 2 1.69 C. Faloutsos
E(t) N(t) Densification Power Law U.S. Patents, citing each other 1.66 C. Faloutsos
E(t) N(t) Densification Power Law Autonomous Systems 1.18 C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: authors & papers 1.15 C. Faloutsos
Summary of ‘laws’ • Power laws for degree distributions • …………... for eigenvalues, bi-partite cores • Small & shrinking diameter (‘6 degrees’) • ‘Bow-tie’ for web; ‘jelly-fish’ for internet • ``Densification Power Law’’, over time C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • other power laws • fractals & self-similarity • case study: Kronecker graphs • tools/results • future steps C. Faloutsos
Power laws • Many more power laws, in diverse settings: C. Faloutsos
A famous power law: Zipf’s law log(freq) “a” • Bible - rank vs frequency (log-log) “the” log(rank) C. Faloutsos