690 likes | 920 Views
Class 7: Evolving Network Models. Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino. Network Science: Evolving Network Models February 2012. P( k ) ~ k - . Empirical findings for real networks. Clustered : clustering coefficient does not depend on network size.
E N D
Class 7: Evolving Network Models Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino Network Science: Evolving Network Models February 2012
P(k) ~ k- Empirical findings for real networks Clustered: clustering coefficient does not depend on network size. Scale-free: The degrees follow a power-laws distribution. Small World: distances scale logarithmically with the network size Network Science: Evolving Network Models February 2012
BENCHMARK 1: Regular Lattices Two-dimensional lattice: Average path-length: Degree distribution: P(k)=δ(k-6) Clustering coefficient: D-dimensional lattice: The average path-length varies as Constant degree P(k)=δ(k-kd) Constant clustering coefficient C=Cd Network Science: Evolving Network Models February 2012
BENCHMARK 2: Random Network Model Erdös-Rényi Model- Publ. Math. Debrecen 6, 290 (1959) • fixed node number N • connecting pairs of nodes with • probability p Degree distribution: Path length: Clustering coefficient: Network Science: Evolving Network Models February 2012
BENCHMARK 3: Small World Model Watts-Strogatz algorithm – Nature 2008 • fixed node number N • connecting pairs of nodes with • probability p Degree distribution: Exponential Path length: Clustering coefficient: Network Science: Evolving Network Models February 2012
P(k) ~ k- EMPIRICAL DATA FOR REAL NETWORKS Regular network Pathlenght Degree Distr. Clustering P(k)=δ(k-kd) Erdos- Renyi Watts- Strogatz Exponential Network Science: Evolving Network Models February 2012
SCALE-FREE MODEL(BA model) Network Science: Evolving Network Models February 2012
BA MODEL: Growth BA model: Growth ER, WS models: the number of nodes, N, is fixed (static models) Real networks continuously expand by the addition of new nodes Barabási & Albert, Science286, 509 (1999) Network Science: Evolving Network Models February 2012
BA MODEL: Growth (Actors/Internet) BA model: Growth Actor network Internet Growth of the Internet routing table Number of movies in IMDB http://www.trainsignaltraining.com/ccna-ipv6 Herr II, Bruce W., Ke, Weimao, Hardy, Elisha, and Börner, Katy. (2007) Movies and Actors: Mapping the Internet Movie Database. In Conference Proceedings of 11th Annual Information Visualization International Conference (IV 2007), Zurich, Switzerland, July 4-6, pp. 465-469. Network Science: Evolving Network Models February 2012
BA MODEL: Growth (www/Pubs) BA model: Growth WWW Scientific Publications http://website101.com/define-ecommerce-web-terms-definitions/ http://www.kk.org/thetechnium/archives/2008/10/the_expansion_o.php Barabási & Albert, Science286, 509 (1999) Network Science: Evolving Network Models February 2012
BA MODEL: Growth BA model: Growth (1) Networks continuously expand by the addition of new nodes Add a new node with m links Barabási & Albert, Science286, 509 (1999) Network Science: Evolving Network Models February 2012
BA MODEL: Preferential Attachment Where will the new node link to? ER, WS models: choose randomly. PREFERENTIAL ATTACHMENT: the probability that a node connects to a node with k links is proportional to k. New nodes prefer to link to highly connected nodes (www, citations, IMDB). Barabási & Albert, Science286, 509 (1999) Network Science: Evolving Network Models February 2012
(1) Networks continuously expand by the addition of new nodes WWW : addition of new documents (2) New nodes prefer to link to highly connected nodes. WWW : linking to well known sites P(k) ~k-3 Origin of SF networks: Growth and preferential attachment GROWTH: add a new node with m links PREFERENTIAL ATTACHMENT: the probability that a node connects to a node with k links is proportional to k. Barabási & Albert, Science286, 509 (1999) Network Science: Evolving Network Models February 2012
All nodes follow the same growth law During a unit time (time step): Δk=m A=m Use: β: dynamical exponent A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999) Network Science: Evolving Network Models February 2012
Fitness Model: Can Latecomers Make It? SF model: k(t)~t ½(first mover advantage) Degree (k) time Network Science: Evolving Network Models February 2012
Degree distribution A node i can come with equal probability any time between ti=m0and t, hence: γ = 3 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999) Network Science: Evolving Network Models February 2012
Degree distribution γ = 3 (i) The degree exponent is independent of m. (ii) As the power-law describes systems of rather different ages and sizes, it is expected that a correct model should provide a time-independent degree distribution. Indeed, asymptotically the degree distribution of the BA model is independent of time (and of the system size N) the network reaches a stationary scale-free state. (iii) The coefficient of the power-law distribution is proportional to m2. A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999) Network Science: Evolving Network Models February 2012
NUMERICAL SIMULATION OF THE BA MODEL m=1,3,5,7 N=100,000;150,000;200,000 m-dependence Stationarity: P(k) independent of N Insert: degree dynamics Network Science: Evolving Network Models February 2012
The mean field theory offers the correct scaling, BUT it provides the wrong coefficient of the degree distribution. So assymptotically it is correct (k ∞), but not correct in details (particularly for small k). To fix it, we need to calculate P(k) exactly, which we will do next using a rate equation based approach. Network Science: Evolving Network Models February 2012
MFT - Degree Distribution: Rate Equation Number of nodes with degree k at time t. Since at each timestep we add one node, we have N=t (total number of nodes =number of timesteps) 2m: each node adds m links, but each link contributed to the degree of 2 nodes Total number of k-nodes Number of links added to degree k nodes after the arrival of a new node: Nr. of degree k-1 nodes that acquire a new link, becoming degree k New node adds m new links to other nodes Preferential attachment Nr. of degree k nodes that acquire a new link, becoming degree k+1 Loss of k-nodes via k k+1 Gain of k-nodes via k-1 k # k-nodes at time t # k-nodes at time t+1 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)
MFT - Degree Distribution: Rate Equation Loss of k-nodes via k k+1 Gain of k-nodes via k-1 k # k-nodes at time t # k-nodes at time t+1 We do not have k=0,1,...,m-1 nodes in the network (each node arrives with degree m) We need a separate equation for degree m modes # m-nodes at time t # m-nodes at time t+1 Add one m-degeree node Loss of an m-node via m m+1 Network Science: Evolving Network Models February 2012 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)
MFT - Degree Distribution: Rate Equation k>m We assume that there is a stationary state in the N=t∞ limit, when P(k,∞)=P(k) k>m Network Science: Evolving Network Models February 2012 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)
MFT - Degree Distribution: Rate Equation m+3 k ... for large k Krapivsky, Redner, Leyvraz, PRL 2000 Dorogovtsev, Mendes, Samukhin, PRL 2000 Bollobas et al, Random Struc. Alg. 2001 Network Science: Evolving Network Models February 2012 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)
MFT - Degree Distribution: A Pretty Caveat Start from eq. Its solution is: Dorogovtsev and Mendes, 2003 Network Science: Evolving Network Models February 2012 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)
Do we need both growth and preferential attachment? Network Science: Evolving Network Models February 2012
growth preferential attachment MODEL A Π(ki) : uniform Network Science: Evolving Network Models February 2012
MODEL B growth preferential attachment P(k) : power law (initially) Gaussian Fully Connected Network Science: Evolving Network Models February 2012
Do we need both growth and preferential attachment?YEP. Network Science: Evolving Network Models February 2012
P(k) ~ k- EMPIRICAL DATA FOR REAL NETWORKS Regular network Pathlenght Degree Distr. Clustering P(k)=δ(k-kd) Erdos- Renyi Watts- Strogatz Exponential Barabasi-Albert P(k) ~ k- Network Science: Evolving Network Models February 2012
DISTANCES IN SCALE-FREE NETWORKS Distances in scale-free networks Size of the biggest hub is of order O(N). Most nodes can be connected within two layers of it, thus the average path length will be independent of the system size. The average path length increases slower than logarithmically. In a random network all nodes have comparable degree, thus most paths will have comparable length. In a scale-free network the vast majority of the path go through the few high degree hubs, reducing the distances between nodes. Some key models produce γ=3, so the result is of particular importance for them. This was first derived by Bollobas and collaborators for the network diameter in the context of a dynamical model, but it holds for the average path length as well. The second moment of the distribution is finite, thus in many ways the network behaves as a random network. Hence the average path length follows the result that we derived for the random network model earlier. Small World Cohen, Havlin Phys. Rev. Lett. 90, 58701(2003); Cohen, Havlin and ben-Avraham, in Handbook of Graphs and Networks, Eds. Bornholdt and Shuster (Willy-VCH, NY, 2002) Chap. 4; Confirmed also by: Dorogovtsev et al (2002), Chung and Lu (2002); (Bollobas, Riordan, 2002; Bollobas, 1985; Newman, 2001
PATH LENGTHS IN THE BA MODEL Bollobas, Riordan, 2002 Network Science: Evolving Network Models February 2012
P(k) ~ k- EMPIRICAL DATA FOR REAL NETWORKS Pathlenght Degree Distr. Clustering Regular network P(k)=δ(k-kd) Erdos- Renyi Watts- Strogatz Exponential Barabasi-Albert P(k) ~ k- Network Science: Evolving Network Models February 2012
CLUSTERING COEFFICIENT OF THE BA MODEL Reminder: for a random graph we have: The numerical results indicate a slightly slower decay. What is the functional form of C(N)? Konstantin Klemm, Victor M. Eguiluz, Growing scale-free networks with small-world behavior, Phys. Rev. E 65, 057102 (2002), cond-mat/0107607 Network Science: Evolving Network Models February 2012
CLUSTERING COEFFICIENT OF THE BA MODEL 1 2 Denote the probability to have a link between node i and j with P(i,j) The probability that three nodes i,j,l form a triangle is P(i,j)P(i,l)P(j,l) The expected number of triangles in which a node l with degree kl participates is thus: We need to calculate P(i,j). Network Science: Evolving Network Models February 2012
CLUSTERING COEFFICIENT OF THE BA MODEL Calculate P(i,j). Node j arrives at time tj=j and the probability that it will link to node i with degree ki already in the network is determined by preferential attachment: Where we used that the arrival time of node j is tj=j and the arrival time of node is ti=i Let us approximate: Which is the degree of node l at current time, at time t=N There is a factor of two difference... Where does it come from? Network Science: Evolving Network Models February 2012
Clustering Coefficient of the BA model CLUSTERING COEFFICIENT OF THE BA MODEL Konstantin Klemm, Victor M. Eguiluz, Phys. Rev. E 65, 057102 (2002) Network Science: Evolving Network Models February 2012
P(k) ~ k- EMPIRICAL DATA FOR REAL NETWORKS Pathlenght Degree Distr. Clustering Regular network P(k)=δ(k-kd) Erdos- Renyi Watts- Strogatz Exponential Barabasi-Albert P(k) ~ k- Network Science: Evolving Network Models February 2012
The origins of preferential attachment. Network Science: Evolving Network Models February 2012
Preferential Attachment: a brief history of collective amnesia Gyorgy Polya (1887-1985) 1923: Polya process in the mathematics literature George Udmy Yule (1871-1951) in 1925: the number of species per genus of flowering plants; Yule process in statistics Robert Gibrat (1904-1980), 1931: rule of proportional growth (the size of the growth and rate of a firm are independent). Gibrat process in economics George Kinsley Zipf (1902-1950), 1949: the distribution of the wealth on the society. Herbert Alexander Simon (1916-2001), 1955, the distribution of city sizes and other phenomena Derek de Solla Price (1922-1983), 1976, used it to explain the citation statistics of scientific publications, "cumulative advantage” Robert Merton (1910-2003), 1968: Matthew effect, Gospel of Matthew: "For everyone who has will be given more, and he will have an abundance. Whoever does not have, even what he has will be taken from him." "Not the first, but the last" Network Science: Evolving Network Models February 2012
CAN WE MEASURE PREFERENTIAL ATTACHMENT? Plot the change in the degree k during a fixed time t for nodes with degree k, and you get (k) To reduce noise, plot the integral of Π(k) over k: No pref. attach: κ~k Linear pref. attach: κ~k2 (Jeong, Neda, A.-L. B, Europhys Letter 2003; cond-mat/0104131) Network Science: Evolving Network Models February 2012
CAN WE MEASURE PREFERENTIAL ATTACHMENT? citation network Plots shows the integral of Π(k) over k: Internet No pref. attach: κ~k actor collab. neurosci collab Linear pref. attach: κ~k2 Network Science: Evolving Network Models February 2012
MECHANISMS RESPONSIBLE FOR PREFERENTIAL ATTACHMENT • Copying mechanism • directed network • select a node and an edge of this node • attach to the endpoint of this edge • Walking on a network • directed network • the new node connects to a node, then to every • first, second, … neighbor of this node • Attaching to edges • select an edge • attach to both endpoints of this edge • Node duplication • duplicate a node with all its edges • randomly prune edges of new node Network Science: Evolving Network Models February 2012
Copying Mechanism Network Science: Evolving Network Models February 2012
ORIGIN OF THE SCALE-FREE TOPOLOGY IN THE CELL:Gene Duplication Proteins with more interactions are more likely to obtain new links: Π(k)~k (preferential attachment) Wagner 2001; Vazquez et al. 2003; Sole et al. 2001; Rzhetsky & Gomez 2001; Qian et al. 2001; Bhan et al. 2002. Network Science: Evolving Network Models February 2012
PREFERENTIAL ATTACHMENT IN PROTEIN INTERACTION NETWORKS k vs. k : increase in the No. of links in a unit time No PA:k is independent of k PA:k ~k Eisenberg E, Levanon EY, Phys. Rev. Lett. 2003 Jeong, Neda, A.-L.B, Europhys. Lett. 2003 Network Science: Evolving Network Models February 2012
SUMMARY: PROPERTIES OF THE BA MODEL • Nr. of nodes: • Nr. of links: • Average degree: • Degree dynamics • Degree distribution: • Average Path Length: • Clustering Coefficient: β: dynamical exponent γ: degree exponent The network grows, but the degree distribution is stationary. Network Science: Evolving Network Models February 2012
DEGREE EXPONENTS γcollab γmetab γintern γsynonyms γwin γwout γactor γsex γcita γ=1 γ=2 γ=3 <k2> diverges <k2> finite BA model Can we change the degree exponent? Network Science: Evolving Network Models February 2012
Evolving network models Network Science: Evolving Network Models February 2012
EVOLVING NETWORK MODELS • The BA model is only a minimal model. • Makes the simplest assumptions: • linear growth • linear preferential attachment • Does not capture • variations in the shape of the degree distribution • variations in the degree exponent • the size-independent clustering coefficient • Hypothesis: • The BA model can be adapted to describe most features of real networks. • We need to incorporate mechanisms that are known to take place in real networks: addition of links without new nodes, link rewiring, link removal; node removal, constraints or optimization Network Science: Evolving Network Models February 2012
BA ALGORITHM WITH DIRECTED EDGES (the simplest way to change the degree exponent) Undirected BA network: Directed BA network: β=1: dynamical exponent γin=2: degree exponent; P(kout)=δ(kout-m) Undirected BA: β=1/2; γ=3 Network Science: Evolving Network Models February 2012 γ = 3 A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)