540 likes | 597 Views
Time Complexity. Measuring Complexity Worst-case analysis Average-case analysis Define M: a det. Turing machine that halts on all input The running time or time complexity of M is the function f: N N, where f(n) is the maximum number of steps that M uses on any input of length n
E N D
Measuring Complexity • Worst-case analysis • Average-case analysis • Define M: a det. Turing machine that halts on all input • The running time or time complexity of M is the function f: NN, where f(n) is the maximum number of steps that M uses on any input of length n • If f(n) is the running time of M, we say M runs in time f(n) and that M is an f(n) time Turing machine.
Big-O and small-o Notation • Def: • Def: • Def:
The class P P is the class of languages that are decidable in polynomial time on a deterministic single-tape TM In other words,
s t
t s • The Class NP • Hamiltonian path in a directed graph G is a directed path that goes through each node once • Note: • It is not hard to find an exponential time algorithm for the HAMPATH problem • The HAMPATH problem can be verified in polynomial time
Not all problem can be verified in poly. time. • Def: A verifier for a language A is an algorithm V, where A = { | V accepts < ,c> for some string c} • A polynomial time verifier runs in polynomial time in the length of • A language A is polynomially verifiable if it has a polynomial time verifier. • A verifier uses additional information c to verify that a string is member of A • C: certificate or proof of membership in A 12653 = 123 x 111 verifiable in poly. time
Def: NP is the class of languages that have polynomial time verifiers NP: Non-deterministic Polynomial time • Def: • Thm: A language is in NP iff it is decided by some non-deterministic polynomial time Turing machine Pf:
Cor: • Examples of problem in NP • Def: clique is a graph, where every two nodes are connected by an edge • A k-clique is a clique that contains k nodes A graph of 5-clique
Def: • Thm: CLIQUE is in NP pf :
Def: • Thm: SUBSET-SUM is in NP Pf:
P = the class of languages where membership can be decided quickly • NP= the class of languages where membership can be verified quickly • BIG Question!! P = NP ?
NP-completeness • The satisfiability problem is to test whether a Boolean formula is satisfiable
Thm: Cook-Levin Theorem • Def:
Def: A B
Thm: Pf:
Def: A language B is NP-complete if it satisfies two condition: 1. B is in NP 2. Every A in NP is polynomial time reducible to B (with only 2, it is called NP-hard) • Thm: Pf:
Thm: Pf:
Cook-Levin Theorem: SAT is NP-complete. Proof : 1> SAT NP: A NTM can guess an assignment to a given formula and accept if the assignment satisfies . 2> For any A NP and show that A is polynomial time reducible to SAT. Let N be a non-deterministic TM that decides A in nk time for some constant k. A tableau for N on w is an nkXnk table whose rows are the configurations of a branch of the Computation of N on input w. # q0 w1 # start configuration cell # # 2nd configuration # # window # # nkth configuration A tableau is accepting if any row of the tableau is accepting configuration.
Every accepting tableau for N on w corresponds to a computation branch of N on w. * Determine whether N accepts w is equivalent to determine whether an accepting tableau for N on w exists. • If , it means cell[i,j] contains an s. • Design so that a satisfying assignment to the variable does correspond to an accepting tableau for N on w. f:polynomial time reduction from A to SAT. On input w, the reduction produces a formula . Let Q and be the state set and the tape alphabet of N. Let For and each we have a variable such variables. (1) i s j
q1 a b q2 c a Ensure that the first row of the table is the starting configuration of N on w. (3)Guarantees that an accepting configuration occurs in the tableau. (4) This part is more complicated ! guarantee that each row of the table corresponds to a configuration that legally follows the preceding Row’s configuration according to N’s rules. a a # b q1 b a q2 a b c # a a # q2 b b a a (c) (b) (a) q1 q1 a a a b b a a q2 a a legal windows (c) (b) (d) b b b a a b b # a b c b q2 a b a # b
Examples of illegal windows: (c) (b) (a) q1 q1 a a a b a b a q2 q1 b q2 a a a a a Claim:If the top row of the table is the start configuration and every window in the table is legal, each row of the table is a configuration that legally follows the preceding one. a a # # b q1 b a q2 a # b c # a a is a legal window a2 j a1 a3 a4 j+1 a5 a6 i-1 i+1 i Size of : Total number of variables Thus the reduction is poly.
Cor: 3SAT is NP-complete. • Cor: CLIQUE is NP-complete. Proof : 3SAT is in NP. How about
Vertex cover of G: If G is an undirected graph, a vertex cover of G is a subset of the nodes where every edge of G touches one of those nodes. {2,3} is a vertex cover. 1 4 {1,3} is not a vertex cover. 3 VERTEX-COVER={<G,k>|G is an undirected graph that has a k-node vertex cover}.
Variable gadgets Clause gadgets Proof : VERTEX-COVER is in NP. • Thm: VERTEX-COVER is NP-complete. Need to show that is satisfiable iff G has a vertex cover with k nodes. Let have m variables and l clauses. Let k=m+2l. • One true variable from each variable gadget. • two node from each clause gadget. (2) Each of the 3 edges connecting the variable gadgets with each clause gadget is covered.
Proof : SUBSET-SUM is in NP. Let be a boolean formula with variables x1,…,xl and clauses c1,…,ck. • Thm: SUBSET-SUM is NP-complete. 1 2 3 4 ….. l C1 C2 ….. Ck 1 0 0 0 ….. 0 1 0 ….. 0 1 0 0 0 ….. 0 0 0 ….. 0 1 0 ….. 0 ….. 1 …. 1 0 0 ….. 0 0 1 ….. 0 1 0 0 ….. 0 1 0 ….. 0 1 0 ….. 0 ….. 0 …. 1 0 ….. 0 1 1 ….. 0 1 0 ….. 0 0 0 ….. 1 clause cj contains xi 1 0 0 ….. 0 1 0 0 ….. 0 1 0 ….. 0 1 0 ….. 0 1 ….. 0 1 ….. 0 1 1 1 1 1 1 ….. 1 3 3 ….. 3
(1) Suppose is satisfiable. Construct a subset S as follows. If xi is assigned TRUE, select yi else select zi. I.e. for each I, we select either yi or zi. Last k digits add up between 1 and 3. Select enough of g and h numbers to make each of the last k digits up to 3. (2) Suppose a subset of S sums to t. We construct a satisfying assignment to . If the subset contains yi , we assigned xi TRUE else assign xi FALSE. This assignment satisfied ! Why?? The table has size
MAX-SAT: Given a boolean formula , and an integer k, is there a truth assignment that satisfies at least k clauses? • Thm: MAX-SAT is NP-complete Pf: SAT p MAX-SAT = c1 c2 … cm k = m • Thm: DOUBLE-SAT: Given a boolean formula , are there at least 2 truth assignment for ? DOUBLE-SAT is NP-complete
Def: Hamiltonian path in a directed graph G is a directed path that goes through each node once. • HAMPATH: Given a directed graph and its 2 vertices s and t, is there a Hamiltonian path from s to t? • Thm: HAMPATH is NP-complete Pf: (long…later) 3SAT p HAMPATH • UHAMPATH: Given a undirected graph and its 2 vertices s and t, is there a Hamiltonian path from s to t?
Proof: (HAMPATH is NPC) • The following is a 3cnf-formula with k clauses: =(a1∨b1∨c1) ∧ (a2∨b2∨c2) ∧ …∧ (ak∨bk∨ck) Each a,b,c is a literal xi or , and x1,…,xl are l variables of . • xi : ci :
s c1 • Proof conti. c2 cl t
Proof conti. • If there is a satisfying assignment, “select exactly oneof the literals” in a clause and assign it TRUE. • If there is a truth assignment, then there exists an Hamiltonian path from s to t.
Proof conti. • If the Hamiltonian path is normal, that is, it goes through the diamonds in order from the top one to the bottom one, we can easily get the satisfying assignment. • Because each clause node appears on the path, by observing the diamond at which the detour to it is taken, we may determine which of the literals in the corresponding clause is TRUE. • If the path zigzag, we assign corresponding variable TRUE If the path zagzig, we assign FALSE.
Proof conti. • We show that a Hamiltonian path must be normal. c
vout v vmid vin u uin s uout sout umid tin t • Thm: UHAMPATH is NP-complete Pf: HAMPATH p UHAMPATH directed G undirected G’ u V(G) vertices of G’: u uin, umid, uout s sout ; t tin edges of G’: u v E(G) uin– umid – uout – vin– vmid – vout if s u1 u2 … uk t is a Hamiltonian path in G, then sout –u1in– u1mid – u1out –u2in– u2mid – u2out – … –ukin– ukmid – ukout – tin is an undirected Hamiltonian path in G’
Undirected Hamiltonian Cycle • Thm: Undirected Hamiltonian Cycle is NP-comlplete Pf: UHAMPATH p UHAMCYCLE • Traveling Salesman Problem: (TSP) Given an integer n2, an nn distance matrix of some cities, and an integer B0, is there a tour that visits every city exactly once and returns to the starting city by traveling within distance B? • Thm: TSP is NP-complete Pf: UHAMCYCLE p TSP
Longest Cycle: Given a graph and integer k, is there a cycle, with no repeated nodes, of length at least k? • Thm: LONGEST CYCLE is NP-complete • SUBGRAPH ISOMORPHISM: Given 2 undirected graphs G and H, is G a subgraph of H? • Thm: SUBGRAPH ISOMORPHISM is NP-complete
Coping with NP-completeness: Approximation algorithms: • Let x be an instance of an optimization problem • Opt(x): the optimum solution of x • A: a poly. time algorithm for x • : positive real number If A satisfies For all x, then we say A is an -approximation algorithm
Eg. The following is a 1-approximation algorithm for the vertex cover problem. G: C={2,3} Algorithm: 1. C :=; 2. while there is an edge [u,v] in G add u, v to C and delete them from G Let be the optimum vertex cover c={ , , ,…, } 2 4 3 1
Polynomial Time Approximation Scheme(PTAS) We say that an approximation scheme is PTAS, if for any fixed >0, the scheme runs in time polynomial in the size n of its input instance. • Ratio bound: • Inapproximable problem: If there is no -approximation algorithm for them with however large , unless P=NP
Thm: TSP is inapproximable unless P=NP Pf: Let G be a graph with n nodes. UHAMCYCLE p TSP If G has a Hamiltonian cycle, then the optimum cost of a tour is n; otherwise if G has no Hamiltonian cycle, then the optimum cost of a tour > n(1+) 1 2+3 1
If TSP had an -approximation algorithm A, then we would be able to tell whether G has a Hamiltonian cycle. Run A for TSP: i> if the returned cost n(1+)+1: No cycle ii> if the returned cost n(1+): has a cycle Unless P=NP, TSP is inapproximable.
Backtracking and Branch&Bound: A={S0} while A is not empty do choose a subproblem S and delete it from A choose a way of branching out of S , say to subproblems S1, S2,…, Sr for each subproblem Si in this list do if test(Si) returns “solution found” else if test(Si) returns “?” then add Si to A Return “no solution”
Eg. x = T x = F y = T y = F z = T z = F