430 likes | 557 Views
Minimum Spanning Trees. Problem Description Why Compute MST? MST in Unweighted Graphs MST in Weighted Graphs Kruskal’s Algorithm Prim’s Algorithm. Spanning Tree: Definition. A Spanning tree = a subset of edges from a connected graph that:
E N D
Minimum Spanning Trees • Problem Description • Why Compute MST? • MST in Unweighted Graphs • MST in Weighted Graphs • Kruskal’s Algorithm • Prim’s Algorithm
Spanning Tree: Definition • A Spanning tree = a subset of edges from a connected graph that: • touches all vertices in the graph (spans the graph) • forms a tree (is connected and contains no cycles) A A A A 9 9 5 9 1 1 C B C C C B B B 2 2 4 4 7 4 7 7 7 D D D D A weighted graph Three spanning trees • Minimum Spanning Tree: Spanning tree with the least total edge cost
MST Problem • Given a weighted, undirected graph G=(V, E), compute the minimum cost spanning tree • MST may not be unique (unless all edge weights are distinct) 7 8 c d b 9 4 2 4 Cost = 37 14 e i 11 a 6 7 10 8 f g h 2 1 7 8 c d b 9 4 2 Cost = 37 4 14 e i 11 a 6 7 10 8 f g h 2 1
Why Compute MST? • Minimize length of gas pipelines between cities • Find cheapest way to wire a house (with minimum cable) • Find a way to connect various routers on a network that minimizes total delay • Eliminate loops in a switched LAN so that broadcast packets will not circle around indefinitely
Some Basic Facts about Free Trees • Notice: An MST is a free tree: • A free tree with “n” vertices has exactly “n-1” edges • There exists a unique path between any two vertices of a free tree • Adding any edge to a free tree creates a unique cycle. Breaking any edge on this cycle restores a free tree
Computing MST–Unweighted Graphs • What if the graph is unweighted or all edge weights are equal? • Simply run BFS or DFS and the resulting tree is a MST A A BFS(A) B C E C B DFS(A) D E D A B C E D
Computing MST – Weighted Graphs • We will present two greedy algorithms for computing MSTs in weighted graphs • Kruskal’s Algorithm • Prim’s Algorithm • A greedy algorithm • always makes choices that currently seem the best • Short-sighted – no consideration of long-term or global issues • Locally optimal does not always mean globally optimal. but works in some cases, e.g., MST, shortest-paths, Huffman coding, …
Greedy MST Algorithms • Let G = (V, E) be an undirected, connected graph whose edges have numeric edge weights, which may be positive, negative or zero • The intuition behind the greedy algorithms is simple: • Maintain a subset of edges A, initially empty • Add edges to A one-by-one until A equals the MST • A <- EmptySet • One-by-one add a “safe” edge to A, until A equals the MST
When is an Edge Safe? • Let S be a subset of the vertices S <= V. • A cut (S, V-S) is just a bipartition of the vertices into two disjoint subsets • An edge (u, v) crosses the cut if one endpoint is in S and the other is in V-S. c d b S V-S e i a f g h
MST Lemma • Let G = (V, E) be a connected, undirected graph with real-valued weights on the edges. • Let (S, V-S) be any cut • Let (u, v) be an edge that crosses this cut & has the minimum weight, i.e., (u, v) is the light edge • Then edge (u, v) is a safe edge c d b S V-S e i a f g h
MST Lemma: Proof 9 4 4 4 u u v u v v 7 6 x x x 8 8 y y y T + (u,v) A T’ = T – (x,y) + (u,v) • Let us assume that all edge weights are distinct • Let T denote the MST • If T contains (u, v) we are done • If not, there must be another edge (x, y) that crosses the cut and is part of the MST • Let us now add (u, v) to T, thus creating a cycle • Now remove (x, y). We get another spanning tree, call it T’
MST Lemma: Proof • We have w(T’) = w(T) – w(x, y) + w(u, v) • Since (u, v) is the lightest edge crossing the cut, we have w(u, v) < w(x, y). Thus w(T’) < w(T) contradicting the assumption that T was an MST 9 4 4 4 u u v u v v 7 6 x x x 8 8 y y y T + (u,v) A T’ = T – (x,y) + (u,v)
Kruskal’s Algorithm • Kruskal’s Algorithm works by attempting to add edges to A in increasing order of weight (lightest edges first). • If the next edge does not induce a cycle among the current set of edges, then it is added to A • If it does, then this edge is passed over, and we consider the next edge in order Generic MST Algorithm A <- EmptySet One-by-one add a “safe” edge to A, until A equals the MST
Kruskal’s Algorithm Kruskal(G = (V, E)){ A = {}; // Initially A is empty Sort E in increasing order by weight w; for each ((u, v) from the sorted list){ if (adding (u, v) does not induce a cycle in A){ Add (u, v) to A; } //end-if } //end-for return A; // A is the MST } //end-Kruskal
Kruskal’s Algorithm: Example 7 8 c d b 9 4 2 4 14 e i 11 a 6 7 10 8 f g h 2 1 Sorted Edge List (i, h) (i, h) (i, g) (i, g) (a, b) (c, f) (a, b) (c, f) (i, c) (g, f) (h, g) (i, c) (g, f) (h, g) (d, f) (d, f) (e, f) (b, h) (e, f) (b, h) (d, e) (d, e) (a, h) (b, c) (a, h) (b, c) (c, d) (c, d)
Kruskal’s Algorithm: Example • As this algorithm runs, the edges of A will induce a forest on the vertices. • As the algorithm continues, the trees of this forest are merged together, until we have a single tree containing all the vertices 7 8 c d b 9 4 2 4 14 e i 11 a 6 7 10 8 f g h 2 1
Kruskal’s Algorithm: Correctness • Observe that the strategy leads to a correct algorithm. Why? • Consider the edge (u, v) that Kruskal’s algorithm seeks to add next and suppose that this edge does not induce a cycle in A • Let A’ denote the tree of the forest A that contains vertex u • Consider the cut (A’, V-A’) • Every edge crossing the cut is not in A’, and (u, v) is the light edge across the cut (because any lighter edge would have been considered earlier by the algorithm) • Thus by the MST Lemma, (u, v) is safe
Kruskal’s Algo: Implementation Kruskal(G = (V, E)){ A = {}; // Initially A is empty Sort E in increasing order by weight w; for each ((u, v) from the sorted list){ if (adding (u, v) does not induce a cycle in A){ Add (u, v) to A; } //end-if } //end-for return A; // A is the MST } //end-Kruskal How to detect the cycle? • The only tricky part in the algorithm is how to detect whether the addition of an edge will create a cycle in A
Kruskal’s Algo: Implementation • This can be done by the disjoint set (Union-Find) data struc., which supports 3 operations • CreateSet(u): Create a set containing a single item u • Find(u): Find the set that contains a given item u • Union(u, v): Merge the set containing u and the set containing v into a common set • It is sufficient to know that each of these operations can be performed in O(logn) time • In fact, there are faster implementations
Kruskal’s Algo: Implementation • In Kruskal’s Algorithm, the vertices of the graph will be the elements to be stored in the sets • The sets will be vertices in each tree of A • The set A can be stored as a simple list of edges
Kruskal’s Algo: Final Version Kruskal(G = (V, E)){ A = {}; // Initially A is empty for each (u in V) CreateSet(u); // Create a set for each vertex Sort E in increasing order by weight w; // O(eloge): // e <= n^2 loge < 2logn for each ((u, v) from the sorted list){ // O(elogn) if (Find(u) != Find(v)) { // if u and v are in Add (u, v) to A; // different trees Union(u, v); } //end-if } //end-for return A; } //end-Kruskal
Kruskal’s Algorithm: Example (h, g) (h, g) 7 8 c d b (i, c) (i, c) 9 4 2 4 (g, f) (g, f) 14 e i 11 a 6 7 (a, b) (a, b) 10 8 f g (c, f) (c, f) h 2 1 (i, g) (i, g) (i, h) (i, h) c d i b g h a e f (c, d) (c, d) (a, h) (a, h) g i (b, c) (b, c) i a i g i h c (d, e) (d, e) i e a d g c a d g c b d (e, f) (e, f) h f g c g c b h f (b, h) (b, h) b h f h f h f (d, f) (d, f)
Prim’s Algorithm • Prim’s Algorithm is another greedy algorithm for MST • Differs from Kruskal’s Algorithm only in how it selects the next “safe edge” to add at each step
Why study Prim’s Algorithm? • 2 reasons to study Prim’s Algorithm • To show that there is more than one way to solve a problem • An important lesson to learn in algorithm design • Prim’s algorithm looks very much like another Greedy Algorithm, called Dijkstra’s Algorithm used to compute shortest paths • Thus not only Prim’s algorithm is a different way to solve a different problem, it is also the same way to solve a different problem
Prim’s Algorithm: Pseudocode Prim(G = (V, E)) { Start with a root vertex “r” (any vertex in the graph) A = {r}; for (i=1; i<=n-1; i++) { 1. Consider the cut (A, V-A) 2. Let (u, v) be the light edge that crosses the cut such that u Є A & v Є V-A 3. Add v to A, i.e., A = A U {v}; } //end-for return A; // A is the MST } //end-Prim
Prim’s Algo: Growing the Tree 12 12 10 10 6 6 r r 7 7 11 3 u u u 4 9 5 5 Current Tree Vertices (A) After “u” is added
Prim’s Algo: Growing the Tree • Observe that we consider the set of vertices A current part of the tree, and its complement (V-A) • We have a cut of the graph (A, V-A) • Which edge should we add next? • MST Lemma tells us that it is safe to add the light edge
Prim’s Algorithm: Example 7 8 9 4 2 4 14 11 6 7 10 8 2 1
Prim’s Algo: Implementation Prim(G = (V, E)) { A = {r}; for (i=1; i<=n-1; i++) { 1. Consider the cut (A, V-A) 2. Let (u, v) be the light edge that crosses the cut such that u Є A & v Є V-A 3. Add v to A, i.e., A = A U {v}; } //end-for return A; // A is the MST } //end-Prim • The key question in the efficient implementation of Prim’s algorithm is • how to update the cut efficiently • how to determine the light edge quickly
Prim’s Algo: Implementation - I Prim(G = (V, E)) { for all u in V do color[u] = white; color[r] = black; for (i=1; i<=n-1; i++) { min = INFINITY; // cost of the light edge (x, y) = (?, ?); // light edge for all (u, v) in E do { if (color[u] == black && color[v] == white && w(u,v) < min){ (x, y) = (u, v); // (u, v) is current light edge min = w(u, v); } //end-if } //end-for color[y] = black; // Add y to A } //end-for return A; // A is the MST } //end-Prim n times O(e) Running Time? O(nxe)
Prim’s Algo: Implementation - II • For faster implementation, we will make use of your favorite DS, the priority queue or a heap. • Recall that a heap stores a set of items, where each item is associated with a key value, and supports 3 operations (all can be implemented in O(logn)) • Insert(Q, u, key): Insert u with key value key in Q • u = Extract_Min(Q): Extract the item with the minimum key value in Q • Decrease_Key(Q, u, new_key): Decrease the value of u’s key value to new_key
Prim’s Algo: Implementation • What do we store in the priority queue? • The idea is the following: • For each vertex in u e V-A (i.e. not part of the current spanning tree), we associate u with a key value key[u], which is the weight of the lightest edge going from u to any vertex in A • We also store in pred[u] the end of this edge in A • If there is no edge from u to a vertex in V-A, then we set its key value to +infinity • We also need to know which vertices are in A • We do this by coloring the vertices in A black • Here is the algorithm…
Prim’s Algo: Implementation Prim(G, w, r){ For each (u in V) { // Initialization key[u] = +infinity; color[u] = white; } //end-for key[r] = 0; // Start at root Pred[r] = nil; Q = build initial queue with all vertices; while (Non_Empty(Q)){ // Until all vertices in MST u = Extract_Min(Q); // Vertex with lightest edge for each (v in Adj[u]) { if (color[v] == white && (w(u, v) < key[v])){ key[v] = w(u, v); // New lighter edge out of v Decrease_Key(Q, v, key[v]); pred[v] = u; } //end-if } //end-for color[u] = black; } //end-while [prev pointers define the MST as an inverted tree rooted at r] } //end-Prim n times O(logn) e times O(logn) Running Time? O(nlogn + elogn)
Prim’s Algorithm: Example 7 8 ∞ 4 ∞ ∞ 4 2 9 4 ∞ 0 0 11 14 ∞ 7 6 8 10 ∞ 8 ∞ ∞ 2 1 7 8 ∞ 4 ∞ 4 2 9 4 ∞ 0 11 14 ∞ 7 6 8 10 ∞ 8 ∞ 2 1
Prim’s Algorithm: Example 7 8 8 ∞ 4 4 ∞ 4 2 9 4 ∞ 0 11 14 ∞ 7 6 8 10 ∞ 8 ∞ 2 1 7 8 ∞ 4 8 4 2 9 4 ∞ 0 11 14 ∞ 7 6 8 10 ∞ 8 ∞ 2 1
Prim’s Algorithm: Example 7 8 8 ∞ 7 4 8 4 2 9 2 4 ∞ 0 11 14 ∞ 7 6 8 10 4 ∞ 8 ∞ 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 ∞ 7 6 8 10 4 8 ∞ 2 1
Prim’s Algorithm: Example 7 8 7 4 8 4 2 9 2 4 2 0 11 14 ∞ 7 6 8 10 4 6 7 8 ∞ 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 10 7 6 8 10 4 7 6 2 1
Prim’s Algorithm: Example 7 8 7 4 8 4 2 9 4 2 0 11 14 ∞ 10 7 6 8 10 4 4 7 2 6 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 10 7 6 8 10 4 7 2 2 1
Prim’s Algorithm: Example 7 8 7 4 8 4 2 9 4 2 0 11 14 10 7 6 8 10 4 1 7 2 2 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 10 7 6 8 10 4 1 2 2 1
Prim’s Algorithm: Example 7 8 7 4 8 4 2 9 4 2 0 11 14 10 7 6 8 10 4 1 2 1 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 10 7 6 8 10 4 1 2 2 1
Prim’s Algorithm: Example 7 8 7 7 4 8 4 2 9 4 2 0 11 14 10 9 7 6 8 10 4 1 2 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 9 7 6 8 10 4 1 2 2 1
Prim’s Algorithm: Example 7 8 7 4 8 4 2 9 4 2 0 11 14 9 9 7 6 8 10 4 1 2 2 1 7 8 7 4 8 4 2 9 4 2 0 11 14 9 7 6 8 10 4 1 2 2 1
Prim’s Algorithm: Example 7 8 7 4 8 4 2 9 4 2 0 11 14 9 7 6 8 10 4 1 2 2 1 prev pointers define the MST as an inverted tree rooted at r 7 7 8 4 8 4 2 9 2 4 0 9 4 1 2 2 1