500 likes | 712 Views
Lecture 3-Search Algorithms. Breadth-First Search Algorithm And Depth-First Search Algorithm. BFS and DFS. Both are methods to traverse graphs along edges For finding vertices People use them to discover some properties of graphs, e.g., cyclic, strongly connected, etc.
E N D
Lecture 3-Search Algorithms Breadth-First Search Algorithm And Depth-First Search Algorithm
BFS and DFS • Both are methods to traverse graphs along edges • For finding vertices • People use them to discover some properties of graphs, e.g., cyclic, strongly connected, etc. • They almost have the same ability, but when and where to choose which one. That depends.
Breadth-first search(outline) • The single source shortest-paths problem for un-weighted graph • The Breadth-first search algorithm • The running time of BFS algorithm • The correctness proof • Note: We only introduce BFS for undirected graphs. But it also works for directed graphs.
d e b c s a Shortest paths Example: 3 simple pathsfromsource s to b: <s, b>, <s, a, b>, <s, e, d, b> of length1, 2, 3, respectively. So the shortest path from s to b is <s, b>. The shortest paths from s to other vertices a, e, d, c are: <s, a>, <s, e>, <s, e, d>, <s, b, c>. There are two shortest paths from s to d.
d e b c s a The shortest-paths problem • Distance d[v]: The length of the shortest path from s to v. For example d[c]=2. Define d[s]=0. • The problem: • Input: A graph G = (V, E) and a source vertex sV • Output: A shortest path (if exists) from s to each vertex v V andthe distanced[v]. • Note:If a vertex v is not reachable from s, then d[v]=.
What does the BFS do? • It starts from a vertex s, and then searches from all the nearest vertices uniformly. • It computes a breadth-first tree. It first adds s to the tree as the root. Once a new vertex v is found by examining vertex u’s adjacent list, v and (u, v) are added into the tree. u is an ancestor of v and v is a descendant of u if u is on the path from root s to v. • Given a graph G = (V, E), and a specified vertex s, the BFS returns: • The distanced[v] from s to v for each vertex v. • A shortest path from s to vertex v for each vertex v. • BFS actually returns a shortest path tree in which the unique simple path from s to node v is a shortest path from s to v in the original graph. • In addition to the two arrays d[v] and [v], BFS also uses another array color[v], which has three possible values: • WHITE: Refers to “undiscovered” vertices; • GRAY: Refers to “discovered” but not “processed” vertices • BLACK: Refers to “processed” vertices.
… The Breadth-First Search • The idea of the BFS: • Expands the frontier between discovered and undiscovered vertices uniformly across the breadth of the frontier. • Visit the vertices as follows: • Visit all vertices at distance 1 • Visit all vertices at distance 2 • Visit all vertices at distance 3 • … … • Initially, s is made GRAY, others are colored WHITE. • When a gray vertex is processed, its color is changed to black, and the color of all white neighbors is changed to gray. • Gray vertices are kept in a queue Q. • Note: Gray vertices form the frontier between Found and Unfound vertices.
The Breadth-First Search(more details) • G is given by its adjacency-lists. • Initialization: • First Part: lines 1 – 4 • Second Part: lines 5 - 9 • Main Part: Lines 10 – 18 • Enqueue(Q, v): Add a vertex v to the end of the queue Q • Dequeue(Q): Extract the first vertex in the queue Q
b c a s f d e Example of Breadth-First Search • Problem: • Given the undirected graph below and the source vertex s, find the distance d[v] from s to each vertex vV, and the predecessor [v] along a shortest path by the algorithm described earlier.
vertex u s a b c d e f color[u] d[u] [u] G W W W W W W 0 NIL NIL NIL NIL NIL NIL NIL Example(Continued) • Initialization Q = <s> (put s into Q (discovered), mark s gray (unprocessed)) b c 0 a s f d e
vertex u s a b c d e f color[u] d[u] [u] B W G W W G W 0 1 1 NIL NIL s NIL NIL s NIL Example(continued) b 1 c • While loop, first iteration • Dequeue s from Q. Find Adj[s]=<b, e> • Mark b,e as “G” • Update d[b], d[e], [b], [e] • Put b, e into Q • Mark s as “B” • Q=<b, e> a 0 s f d e 1
vertex u s a b c d e f color[u] d[u] [u] B G B G W G G 0 2 1 2 1 2 NIL b s b NIL s b Example(continued) 2 b 1 c • While loop, second iteration • Dequeque b from Q, find Adj[b]=<s, a, c, f> • Mark a, c, f as “G”, • Update d[a], d[c], d[f], [a], [c],[ f] • Put a, c, f into Q • Mark b as “B” • Q=<e, a, c, f> 2 a 0 2 s f d e 1
2 b 1 c vertex u S a b c d e f color[u] d[u] [u] 2 B G B G G B G 0 2 1 2 2 1 2 NIL b s b e s b a 0 2 s f 2 d e 1 Example(continued) • While loop, third iteration • Dequeque e from Q, find Adj[e]=<s, a, d, f> • Mark d as “G”, mark e as “B” • Update d[d], [d], • Put d into Q • Q=<a,c,f,d>
vertex u s a b c d e f color[u] d[u] [u] B B B G G B G 0 2 1 2 2 1 2 NIL b s b e s b Example(continued) 2 • While loop, fourth iteration • Dequeque a from Q, find Adj[a]=<b, e> • mark a as “B” • Q=<c, f, d> b 1 c 2 a 0 2 s f 2 d e 1
2 b 1 c vertex u s a b c d e f color[u] d[u] [u] 2 B B B B G B G 0 2 1 2 2 1 2 NIL b s b e s b a 0 2 s f 2 d e 1 Example(continued) • While loop, fifth iteration • Dequeque c from Q, find Adj[c]=<b, d> • mark c as “B” • Q=<f, d>
2 2 b 1 c vertex u S a b c d e f color[u] d[u] [u] 2 B B B B G B B 0 2 1 2 2 1 2 NIL b s b e s b a 0 2 s f 2 2 d e 1 Example(continued) • While loop, sixth iteration • Dequeque f from Q, find Adj[f]=<b,e> • mark f as “B” • Q=<d>
2 2 b 1 c vertex u S a b c d e f color[u] d[u] [u] 2 B B B B B B B 0 2 1 2 2 1 2 NIL b s b e s b a 0 2 s f 2 2 d e 1 Example(continued) • While loop, seventh iteration • Dequeque d from Q, find Adj[d]=<c,e> • mark d as “B” • Q is empty
2 2 b 1 c vertex u s a b c d e f color[u] d[u] [u] 2 B B B B B B B 0 2 1 2 2 1 2 NIL b s b e s b a 0 2 s f 2 2 d e 1 Example(continued) • While loop, eighth iteration • Since Q is empty, stop
2 2 b 1 c vertex u s a b c d e f color[u] d[u] [u] 2 B B B B B B B 0 2 1 2 2 1 2 NIL b s b e s b a 0 2 s f 2 2 d e 1 Example(continued) Question: What happens if we change the order of vertices in each Adj[] How do you construct a shortest path from s to any vertex by using the following table?
Analysis of running time of BFS Constant times C1 n C2 n-1 C3 n-1 C4 n-1 C5 1 C6 1 C7 1 C8 1 C9 1 C10 t(n) C11 t(n)-1 C12 C13 C14 C15 C16 C17 C18 t(n)-1 Where t(n) stands for the # of condition tests of while loop, and w(u) stands for the # of vertices first discovered by u
Analysis of the Breadth-First Search Algorithm • For simplification, we use one constant c instead of the ci’s (or just 1). • The following analysis is valid for allgraphs. (How about connected graphs?) • The initialization requires c(4V + 2) time units. • Each vertex u is en-queued and thus de-queued at mostonce, thus t(n) V, and • What is the total amount of time for ? • Since each vertex is exactly first discovered at most once, so, • Hence the total amount of time needed for processing the whole graph is
Graphs that are not connected • For such graphs, only the vertices v that are in the same component as s will get a value d[v]. • In particular, we can use the array d[ ] at the end of the computation to decide whether the graph is connected. How? • Alternatively, we can use the array color[ ] or the array [ ]. How?
Continued • We can modify BFS so that it returns a forest. • More specifically, if the original graph is composed of connected components C1, C2, …,Ck, then BFS will return a tree corresponding to each Ci.
For each vertex uV color[u]=WHITE; d[u]= ; [u]=NIL; For each vertex u V If d[u] = then BFSR(G, u); Note, BFSR is a revised version of BFS, what is the difference? BFS for disconnected graph
BFS algorithm computes the shortest paths • To prove this conclusion, we need to prove the following two parts: • Prove that the BFS algorithm outputs the correct distance d[v] • Prove that the paths obtained by using the array [ ] are shortest. • We need to prove the predecessor sub-graphG in the following is a breadth-first tree (if it contains all the vertices that are reachable from s, and there is a unique simple path from s to each vertex v in the G that is also a shortest path from s to v). • G=(V, E), where V = {v | [v] Nil } {s} E = {([v], v), v V-{S}}
Correctness proof • We will prove the correctness of the Dijkstra’s algorithm, which implies the correctness of the BFS algorithm. • Here is a simple architecture of the proof: • The vertices in the queue have at most two consecutive d[] values, and the vertices with smaller d[] values are in the head (by induction on # of vertices processed). • The earlier a vertex is processed, the smaller its d[] value is, vise versa (by induction on the order of vertices processed). • When a vertex is first discovered, its d[] value is equal to its distance from s (by induction on the order of vertices discovered).
Depth-first Search Algorithm(outline) • The idea of DFS • The DFS algorithm • The time complexity of DFS algorithm • Properties of the DFS algorithm
Depth-First Search Algorithm(ideas) • In DFS, edges are explored out of the most recently discovered vertex v. Only edges to unexplored vertices are explored. • When all of v’s incident edges have been explored, search will backtrack to explore edges leaving the vertex from which v is discovered. • The process continues until all the vertices reachable from the original source vertex are discovered. • If any undiscovered vertex remain, then one of them is selected as a new source vertex, and the search repeat from this new source vertex. • The process is repeated until all the vertices are discovered. • The strategy of DFS is to search “deeper” whenever it is possible.
Four Arrays for DFS Algorithm • color[u]: the color of each vertex visited • WHITEundiscovered • GRAY discovered but not finished processing • BLACKfinished processing. • [u]: The predecessor of u, indicating the vertex from which u is first discovered. • d[u]: Discovery time, a counter indicating when vertex u is first discovered. • f[u]: Finishing time, a counter indicating when the processing of vertex u (and the processing of all its descendants ) is finished.
DFS Algorithm • Input: A graph G = (V, E). • Outputs: Four arrays: Color[], [], d[], and f[]. • Note: From [], we can derive the predecessor sub-graph G=(V, E), where E = {([v], v), v V and [v]Nil } The predecessor sub-graph forms a depth-first forest that is composed of depth-first trees.
DFS Algorithm (Example) 1/ a d b c g e f
Example(continued) 1/ a d 2/ b c g e f
Example(continued) 1/ a d 2/ b 3/ c g e f
Example(Continued) 1/ a d 2/ b 3/ c g e f 4/
Example(Continued) 1/ a d 2/ b 3/ c g e f 4/5
Example(Continued) 1/ a d 2/ b 3/6 c g e f 4/5
Example(Continued) 1/ a d 2/7 b 3/6 c g e f 4/5
Example(Continued) 1/8 a d 2/7 b 3/6 c g e f 4/5
Example(Continued) 1/8 9/ a d 2/7 b 3/6 c g e f 4/5
Example(Continued) 1/8 a 9/ d 2/7 b 3/6 c g e 10/ f 4/5
Example(Continued) 1/8 a 9/ d 2/7 b 3/6 c 10/11 g e f 4/5
Example(continued) 1/8 a 9/ d 2/7 b 3/6 12/13 c g e 10/11 f 4/5
1/8 a 9/14 d 2/7 b 12/13 3/6 c 10/11 g e f 4/5 Example(Continued) What happens if we change the order vertices of in each Adj[] or the order of vertices in line 5 of DFS(G)? In a directed graph?
What does DFS do? Given a graph G, it traverses all vertices of G and • Constructed a collection of rooted trees together with a set of source vertices (roots) • Outputs two arrays d[ ]/f[ ] Note: Forest is stored in array [ ], the [ ] value of a root is null.
Running time analysis of DFS DFS-VISIT(u) color[u]=gray; //1 time=time+1; //1 d[u]=time; //1 for each vadj[u] //d(u) do if color[v]=white //d(u) then [v]=u; //w(u) DFS-VISIT(v); //w(u) color[u]=black; //1 time=time+1; //1 f[u]=time; //1 sum: T(u) =6+2d(u)+2w(u) DFS(G) for each u in V//n do color[u]=white; //n [u]=null; //n time=0; //1 for each u in V//n do if color[u]= white //n then DFS-VISIT(u); //t(n) • Sum:5n+1+t(n)=(n) Where t(n) stands for # of DFS-VISIT(u) calls,w(u) stands for # of vertices first discovered by u, and T(u) stands for the time of DFS-VISIT(u) procedure.
Running time analysis of DFS Total running time is And since, So, total running time is: (V+E) Note: Combine t(n) and w(u) together.