Rapidly Mixing Markov Chains on Combinatorial Objects

Rapidly Mixing Markov Chains on Combinatorial Objects Date: 2005/4/25 Advisor: Sy-Yen Kuo Speaker: Szu-Chi Wang

Outline • Notation and Preliminaries • Rapid Mixing Markov Chains • Commonly Studied Models • Conclusions and Future Works • References

Notation • A Markov chain is specified by the transition matrix P • Let 0 be the initial distribution and t be the distribution after t steps The dynamics follows • If P is irreducible and aperiodic (viz ergodic) then tconverges to a unique stationary distribution  such that (independent of 0)

Preliminaries • Conceptually M defines a random walk over  (viz moving from one configuration to another) • Design a Markov chain that would converge quickly to the desired distribution provides a useful tool for hard sampling problems • Two questions immediately arise 1. How do we modify this chain in order to sample from a complicated distribution? 2. How long do we have to simulate the walk before we can trust our samples? (viz they are chose from a distribution very close to )

laziness factor required knowledge The Metropolis Algorithm The most celebrated technique to assign the transition probabilities of a Markov chain so that it will converge to any chosen distribution  Let  be the desired probability distribution and di be the degree of i For each neighbor j of node i let

The Convergence Time  Thus the next question to ask how quickly t converges to   Relevant metrics 1. The total variation difference between t and  is 2. For  > 0 the mixing time is defined as  A Markov chain is called rapidly mixing if is bounded above by poly(n) and

Foundations of Algebraic Graph Theory  Let G(V, E) be and n-vertex, undirected graph with max degree   Given the canonical labeling of eigenvalues i and orthonormal eigenvectors ei for the adjacency matrix A(G) 1. If G in connected, then 2 < 1 2. For 1  i  n, |i|   3.  is an eigenvalue iff G is regular 4. If G is d-regular, then the eigenvalue 1 =  has the eigenvector 5. G is bipartite iff for every eigenvalue  there is an eigenvalue  6. Suppose that G is connected, then G is bipartite iff  is an eigenvalue 7. If G is d-regular and bipartite, n =  and

The Mixing Time It is well-established that the eigenvalue gap of the transition matrix provides a good bound on the mixing rate  Let 0, 1, ||-1be the eigenvalues of P, 1 = 0> |1|  |i| for all i2 Let then for all we have  Practically, determining the eigenvalues tends to be far too difficult

Techniques for Bounding Mixing Times  Conductance For any set S  let , where is regarded as the capacity of (x, y) and The conductance is defined as  For a finite, reversible, ergodic Markov chain M with loop prob.  ½ for all states, the mixing time of M satisfies

Techniques for Bounding Mixing Times (cont.) Coupling A coupling is a Markov chain M on    defining a stochastic process with the properties: I. Each of the processes Xt and Yt is a faithful copy of M (given initial states X0 = x and Y0 = y) II. If Xt = Yt then Xt+1 = Yt+1

Techniques for Bounding Mixing Times (cont.)  Path Coupling Let  be an integer-valued metric defined on which takes values in Let S be a subset of such that for all there exist a path between Xt and Yt Suppose a Coupling of the Markov Chain M is defined on all pairs such that   < 1 s.t. for all , then the mixing time of M satisfies

Commonly Studied Model For G = (V, E), let and N(v) denote the neighbors of v A proper k-coloring is an assignment such that all adjacent vertices receive different colors The positive-recurrent states of M are the proper coloring of G and the chain is ergodic on these states

Illustration of Path Coupling rapid mixing if only updates with zN(u) and c {cx, cy} may succeed or fail in exactly one chain u u

A Cutting-Edge Study Non-uniform Random Membership Management in Peer-to-Peer Networks Ming Zhong Kai Shen Joel Seiferas INFOCOM 2005

node branch resistance Electrical Networks Solve it via Kirchhoff’s Law and Ohm’s Law 0.5 volt 0.5 volt a 1 1 b c 1.0 amp 2 1.0 volt

Electrical Networks (cont.) Given G, let N(G) be defined as (1) it has a node for each vertex in V (2) for every edge in E it has a 1.0 ohm resistance in N(G)  Use the language of electrical network theory for N(G) The effective resistanceRuv between two u, v is |volt (u) – volt (v)| when one amp is injected into u and removed from v  The commute timeCuv between two nodes u and v is the expected time for a random walk starting at u to return u after at least one visit to v

Electrical Networks (cont.)  Corollaries 1. For any two vertices u and v in G the commute time satisfies 2. Let T be any spanning tree of G and C(G) denote the cover time 3. The resistance of G characterizes its cover time:

Conclusions and Future Works Markov chain Monte Carlo serves as a computational means for approximate sampling from large and complicated sets  Future directions might include - Membership management in large-scale distributed networks - Information dissemination in sensor/mobile ad hoc networks - Reliable surveillance systems - Interdisciplinary studies (e.g. in Statistical Physics, the prob. of a configuration is related to its energy)

References [1] R. Motwani and P. Raghavan, Randomized Algorithms, Cambridge Press [2] R. Bubley and M. Dyer, Path Coupling: A technique for Proving Rapid Mixing in Markov Chains, Proc. of 38th IEEE FOCS, 1997 [3] D. Randall, Mixing, Proc. of 44th IEEE FOCS, 2003 [4] M. Zhong, K. Shen, J. Seiferas, Non-uniform Random Membership Management in Peer-to-Peer Networks, Proc. of IEEE INFOCOM, 2005

Thanks for Your Attention

Rapidly Mixing Markov Chains on Combinatorial Objects

Rapidly Mixing Markov Chains on Combinatorial Objects

Presentation Transcript

11 - Markov Chains

Markov Chains

Markov Chains

Markov Chains

Markov Chains

Markov Chains

Markov Chains

Markov Chains

Eager Markov Chains

Markov chains

Markov chains

Distributed Markov Chains

Markov Chains

Markov Chains Regular Markov Chains Absorbing Markov Chains

Markov Chains and Hidden Markov Models

Markov Chains

Markov Chains

Property Testing on Combinatorial Objects

Tutorial: Markov Chains

Markov Chains

Markov Chains

Markov chains