360 likes | 527 Views
“Expansion” in Power Law and Scale Free Graphs. Milena Mihail Georgia Tech with Christos Gkantsidis, Christos Papadimitriou and Amin Saberi. Graphs with Skewed Degree Sequences. Communication Networks
E N D
“Expansion” in Power Law and Scale Free Graphs Milena Mihail Georgia Tech with Christos Gkantsidis, Christos Papadimitriou and Amin Saberi
Graphs with Skewed Degree Sequences Communication Networks This Talk: Algorithmic Issues, “Expansion”,spectral gap determine performace of key algorithms. Metabolic Networks
Sprint AT&T How does Congestion Scale? Demand: n2, uniform. What is load of max congested link, in optimal routing ? ISPs: 900-14K Routers: 500-200K WWW: 500K-3B P2P: hundred Ks
Real Internet Topologies E[degree]~3 Degrees not Concentrated around mean Not Erdos-Renyi CAIDA http://www.caida.org
Degree-Frequency Power Law Models by Kumar et al 00, x Bollobas et al 01, x Fabrikant et al 02 Erdos-Renyi sharp concentration E[d] = const., but No sharp concentration E[d] = const., but No sharp concentration frequency 1 3 4 5 2 10 100 degree
[Interdomain Routing: Faloutsos et al 99] [WWW: Kumar et al 99, Barabasi-Albert 99] Degree-Frequency Rank-Degree Eigenvalues (Adjacency Matrix) Power Laws
Models for Power Law Graphs • EVOLUTIONARY • Macroscopic : Growth & Preferential Attachment • Simon 55, Barabasi-Albert 99, Kumar et al 00, • Bollobas-Riordan 01. • Microscopic : Growth & Multiobjective Optimization, • QoS vs Cost • Fabrikant-Koutsoupias-Papadimitriou 02. • STRUCTURAL (aka CONFIGURATIONAL) • “Random” graph with “power law” degree sequence.
Structural Random Graph Model Given Choose random perfect matching over minivertices Molley&Reed 95-98, Aiello,Chung,Lu 00, Tagmunarunkit et al 02
Congestion in the “Core” Theorem [Gkantsidis,MM, Saberi 02]: For a random graph arising from degree sequence O(n½ ) ≥ d1≥d2≥…≥dn ≥3 there is a flow that routes demand di* dj between all vertices i and j with max link congestion O(n log2n) almost surely.
Proof : Step 1 : Approximation algorithms for multicommodity flow reduce congestion to conductance (special case of sparsest cut). Step 2 : Bound conductance - MAIN LEMMA.
Proof, Step 1 : Reduce to Conductance By Maximum multi-commodity flow, [Leighton & Rao 95]
Proof, Step 2 : Main Lemma [Gkantsidis,MM, Saberi 02]:
Proof of MAIN LEMMA: Stirling
Proof of MAIN LEMMA: BIG SMALL ignore Stirling
In an Evolutionary Model ?Growth with Pref. Attachment One vertex at a time New vertex attaches to d existing vertices
Reduction to Random Matching[Bollobas & Riordan 01] t=5 t=4 t=1 t=2 t=3
Reduction to Random Matching[Bollobas & Riordan 01] t=5 t=4 t=1 t=2 t=3
In an Evolutionary Model ?Growth with Pref. Attachment Theorem [MM, Saberi 02]: For a graph grown with preferential attachment with d ≥ 3 there is a flow that routes demand di* dj between all vertices i and j with max link congestion O(n log n) almost surely. Main Lemma: almost surely. Open Question: Analyze a graph grown one vertex or edge at a time, where with probability a new vertex comes and attaches preferentially and with probability a new edge grows preferentially between existing vertices.
Spectral Implication • Theorem: Eigenvalue separation • for stochastic normalization of adjacency matrix [Alon 85, • Jerrum&Sinclair 88]
Theorem: Eigenvalue separation • for stochastic normalization of adjacency matrix [Alon 85, • Jerrum&Sinclair 88] On the eigenvalue Power Law [M.M. & Papadimitriou 02] Rank-Degree Using matrix perturbation [Courant-Fisher theorem] in a sparse random graph model . Eigenvalues (Adjacency Matrix) Spectral Implications
Wwith probability at least Ffor large enough Theorem : [M.M. & Papadimitriou 02]
Proof : Step 1. Decomposition LR = Vertex Disjoint Stars - LR-extra LL RR
Proof: Step 2: Vertex Disjoint Stars Degrees of each Vertex Disjoint Stars Sharply Concentrated around its Mean d_i Hence Principal Eigenvalue Sharply Concentrated around
LL has edges Proof: Step 3: LL, RR, LR-extra LR-extra has max degree RR has max degree
LL has edges Proof: Step 3: LL, RR, LR-extra LR-extra has max degree RR has max degree
Vertex Disjoint Stars have principal eigenvalues All other parts have max eigenvalue Proof: Step 4: Matrix Perturbation Theory QED
Implication for Info Retrieval Term-Norm Distribution Problem : Spectral filtering, without preprocessing, reveals only the large degrees.
Implication for Info Retrieval Term-Norm Distribution Problem : Spectral filtering, without preprocessing, reveals only the large degrees. Local information. No “latent semantics”.
Implication for Information Retrieval Term-Norm Distribution Problem : Application specific preprocessing (normalization of degrees) reveals clusters: WWW: related to searching, Kleinberg 97 IR, collaborative filtering, … Internet: related to congestion, Gkantsidis et al 02 Open : Formalize “preprocessing”.
Further Directions: Generalize theory of Regular Expanders • Routing • Integral paths? • Short paths? • Reliability? • Cover time? Related to Crawling • Hitting time? Related to Searching • Planted model? Information Retrieval Peleg&Upfal’88 … Broder,Frieze&Upfal’01 Kleinberg&Rubinfeld’97 (Experimental work: Gkantsidis, MM, Zegura 02.)
Metabolic Networks Statistics of fixed size subgraphs? Related to “motifs” in metabolic networks. Model (explain) heavy tailed statistics in noncoding part of DNA? Related to stages of species evolution.
Evaluation of Synthetic Topology Generators Core of the Network Entire Topology