430 likes | 544 Views
Tight Hardness Results for Some Approximation Problems [Raz,Håstad,...]. Adi Akavia Dana Moshkovitz & Ricky Rosen S. Safra. “Road-Map” for Chapter I. Gap-3-SAT . expander. Gap-3-SAT-7 . Parallel repetition lemma. par[ , k] . X Y is NP-hard. Maximum Satisfaction.
E N D
Tight Hardness Results forSome Approximation Problems[Raz,Håstad,...] Adi Akavia Dana Moshkovitz & Ricky Rosen S. Safra
“Road-Map” for Chapter I Gap-3-SAT expander Gap-3-SAT-7 Parallel repetition lemma par[, k] XY isNP-hard
Maximum Satisfaction Def: Max-SAT • Instance: • A set of variables Y = { Y1, …, Ym } • A set of Boolean-functions (local-tests) over Y = { 1, …, l } • Maximization: • We define () = maximum, over all assignments to Y, of the fraction of i satisfied • Structure: • Various versions of SAT would impose structure properties on Y, Y’s range and
Max-E3-Lin-2 Def: Max-E3-Lin-2 • Instance: a system of linear equationsL = { E1, …, En } over Z2each equation of exactly 3 variables[whose sum is required to equal either 0 or 1] • Problem: Compute (L)
Example x1+x2+x3=1 (mod 2) x4+x5+x6=1 x7+x8+x9=1 x1+x4+x7=0 x2+x5+x8=0 x3+x6+x9=0 • Assigning x1-6=1, x7-9=0 satisfy all but the third equation. • No assignment can satisfy all equation, as the sum of all leftwing of equations equals zero (every variable appears twice) while the rightwing sums to 1. • Therefore, (L)=5/6.
2-Variables Functional SAT Def[ XY ]: over • variables X,Y of range Rx,Ry respectively • each is of the form xy: RxRy an assignment A(XRx, YRy)satisfies xyiffxy(A(x))=A(y) [ Namely, every value to x determines exactly 1 satisfying value for y] Thm: distinguishing between • A satisfies all • A satisfies < fraction of is NP-hard as long as |Rx|,|Ry|> -0(1)
Proof Outline Gap-3-SAT ? Gap-3-SAT-7 Def: 3SAT is SAT where every iisa disjunction of 3 literals. Def: gap-3SAT-7 is gap-3SAT with theadditional restriction, that everyvariable appears in exactly 7 local-tests Theorem: gap-3SAT-7 is NP-hard par[, k] XY isNP-hard
Expanders Gap-3-SAT ? Expanders Gap-3-SAT-7 Def: a graph G(V,E) is a c-expander if for everySV, |S| ½|V|: |N(S)\S| c·|S| [where N(S) denotes the set of neighbors of S] Lemma: For every m, one can construct in poly-time a 3-regular, m-vertices, c-expander, for some constant c>0 Corollary: a cut between S and V\S, for |S| ½|V| must contain > c·|S| edges par[, k] XY isNP-hard
Reduction Using Expanders Assume ’ for which (’) is either 1 or 1-20/c. is ’ with the following changes: • an occurrence of y in i is replaced by a variable xy,i • Let Gy, for every y, be a 3-regular, c-expander over all occurrences xy,iof y • For every edge connecting xy,i to xy,j in Gy, add to the clauses (xy,i xy,j) and (xy,i xy,j) It is easy to see that: • || 10 |’| • Each variable xy,iof appears in exactly 7i constructible by the Lemma ensuring equality
Correctness of the Reduction • is completely satisfiable iff ’ is • In case ’ is unsatisfiable: (’) <1-20/cLet A be an optimal assignment to Let Amajassign xy,i the value assigned by A to the majority, over j, of variables xy,jLet FA and FAmaj be the sets of unsatisfied by A and Amaj respectively: ||·(1-()) = |FA| = |FAFAmaj|+|FA\FAmaj| |FAFAmaj|+½c|FAmaj\FA| ½c|FAmaj| andsince Amaj is in fact an assignment to ’() 1- ½c(1- (’))/10 < 1- ½c(20/c)/10= 1-
Notations Def: For a 3SAT formula over Boolean variables Z, • Let Zk be the set of all k-sequences of ’s variables • Let k be the set of all k-sequences of ’s clauses Def: Let Y=Zk and X=k, and • RYbe the set of all assignments to k variables • RXbe the set of all sat. assignments to k clauses Def: For any set of k variables yZk, and a set of k clauses xk, denote y xy is a choice of one variable of each clause in x.
Parallel SAT Def: for a 3SAT formula over Boolean variables Z, let par[, k] be the following 2-var functional SAT par[, k]’s variables: • Y=Zk (one var. for each k z’s) each y‘s range is RY • Xk (one var. for each k ‘s) each x‘s range is RX par[, k]local-tests:xyfor every y xwhich accepts if x’s valuerestricted to y is y’s value[namely, if the assignments are consistent] |RY|=2k |RX|=7k
Gap Increases with k Gap-3-SAT Gap-3-SAT-7 Parallel repetition lemma par[, k] Note that if () = 1 then (par[, k]) = 1 On the other hand, if is not satisfiable: Lemma: (par[, k]) ()c·kfor some c>0 Proof: first note that1-(par[, 1]) (1-())/3 now, to prove the lemma, apply the Parallel-Repetition lemma [Raz] to par[, 1] XY isNP-hard In any assignment tos variables, any unsatisfied clause in”induces“ at least 1 (out of corresponding 3) unsatisfied par[, 1]
Conclusion: XY isNP-hard • distinguish between: • A satisfies all • A satisfies <p fraction of is NP-hard as long as |Rx|,|Ry| p-0(1)
“Road-Map” for Chapter II XY isNP-hard Long code L () = () LLC-Lemma: (L) = ½+/2 (par[,k]) > 42
Main Theorem Thm: gap-Max-E3-Lin-2(1-, ½+)is NP-hard. That is, for every constant 0<<¼ it is NP-hard to distinguish between the case where 1- of the equations are satisfiable and the case where ½+ are. [ It is therefore NP-Hard to approximateMax-E3-Lin-2 to within factor 2- for any constant 0<<¼]
This bound is tight • A random assignment satisfies half of the equations. • Deciding whether a set of linear equations have a common solution is in P (Gaussian elimination).
XY isNP-hard Long code L LLC-Lemma: (L) = ½+/2 (par[,k]) > 42 Distributional Assignments Let be a SAT instance over variablesZ of range R. Let (R) be all distributions over R Def: a distributional-assignment to is A: Z (R)Denote by () themaximum over distributional-assignments A of theaverage probability for to be satisfied,if variables` values are chosen according to A Clearly() (). Moreover Prop: () ()
Distributional-assignment to 1 1 0 1 0 0 0 1 x1 x3 x2 xn
Restriction and Extension Def: For any yYover RY and xXover RXs.t xy • The natural restrictionof an aRXto RY is denoted a|y • The elevationof a subset FP[RY] to RX is the subset F*P[RX] of all members a of RX for which xy(a) FF* = { a | a|y F }
XY isNP-hard Long-Code Long code L In the long-code the set of legal-words consists of all monotone dictatorships This is the most extensive binary code, as its bits represent all possible binary values over n elements LLC-Lemma: (L) = ½+/2 (par[,k]) > 42
Long-Code • Encoding an element e[n]: • Eelegally-encodes an element e if Ee = fe T F F T T
Long-Code over Range R |BP[R]| = 2|R|-1-1 BP[R] the set of all subsets of R of size ≤½|R| • Our long-code: in our context there’re two types of domains “R”:Rx and Ry . Def: an R-long-code has 1 bit for each F BP[R] Def: a legal-long-code-word encoding an element eR, is ERe: BP[R] {-1, 1} that assigns the Boolean value eF to every subset F BP[R]
Linearity of a Legal-Encoding An assignment A : BP[R] {-1,1}, if legal, is a linear-function, i.e., F, G BP[R]:A(F) A(G) A(FG) Unfortunately, any character is linear as well
XY isNP-hard The Variables of L Long code L Consider (xy) withsmall constant p (to befixed later) L has 2 types of variables: • a variable z[y,F] for every variable y of and a subset F BP[RY] • a variable z[x,F] for every variable x of and a subset F BP[RX] LLC-Lemma: (L) = ½+/2 (par[,k]) > 42
The Distribution Def: denote by the distribution over all subset of Rx, which assigns probability to a subset H as follows:Independently, for each a Rx, let • aH with probability • aH with probability 1- One should think ofas a multiset of subsets in which every subsetHappears with the appropriate probability
Linear equation L‘s linear-equations are the union, over all xy par[,k], of: xy , F, G, H there’s a testz[y, F] + z[x, G] = z[x, F* G H ] (mod 2) ‘F*GH’is the symmetric differenceof the extension of Fto SC, GandH
Multiplicative representation General Fourier Analysis facts Revised Representation Representation by Fourier Basis Multiplicative Representation: • True -1 • False 1 • L: • z[X,*], z[Y,*] {-1, 1} • z[X, F] • z[Y, G] • z[X, F* • G • H ] = 1 Claim 2 Claim 1 Claim 3:The expected success of the distributional assignment on [C,V]par[,k] is at least 4 2 (par[,k]) > 42
XY isNP-hard L Prop: if () = 1 then (L) = 1- Proof:Let A be a satisfying assignment to . Assign all variables of Laccording to the legal encoding of A’s values.A linear equation of L, corresponding to X,Y,F,G,H, would be unsatisfied exactly if A(x)H, which occurs with probability over the choice of H. LLC-Lemma: (L) = ½+/2() > 42 LLC-Lemma: (L) = ½+/2 (par[,k]) > 42 Note: independent of p! (Later we define psmall enough). = 2(L) -1
Hardness of approximating Max-E3-Lin-2 Main Theorem: For any constant >0: gap-Max-E3-Lin-2(1-,½+) is NP-hard. Proof: By proposition(’) = 1 (L’) 1-
Lemma Main Theorem Prop:Let be a constant >0 s.t.:(1-)/(½+/2) 2-Set p < 43Then () < 1 (L’) ½+/2 ½+ Proof: Assume, by way of contradiction, that (L) ½+/2 then:43 > (’)c·k () > 42, which implies that > .Contradiction! of the parallel repetition lemma
An Assignment to L For any variable x of The set z[x,*] of variables ofL represents the long-code of xLet be the Fourier-Coefficient <|z[x,*],s> For any variable Y of The set z[Y,*] of variables ofL represent the long-code of yLetbe the Fourier-Coefficient <|z[y,*],s>
The Distributional Assignment Def: Let be a distributional-assignmentto as follows: • For any variable x • Choose a set SRx with probability , • Uniformly choose a random assignment a S. • For any variable y • Choose a set S Ry with probability , • Uniformly choose a random assignment b S.
Home Assignment • Given an assignment to a Longcode A:BP[R] {-1, 1}, show that for any (constant) > 0:| {e R | (Ee, A) > ½ + ½ } | -2where (A1, A2)is the fraction of bits A1 and A2differ on.
What’s Ahead: We would show that ‘s expected success on xy is > 42 in two steps: First we show (claim 1) that ‘s success probability, for any xy is Then show (claim 3) that value to be 42
General Fourier Analysis facts Claim 1 Multiplicative representation go to claim3 Representation by Fourier Basis Claim 1 Claim 1: The success probability of on xy is Proof:That success probability is at least and there is at least one bS s.t. b|y S’ Claim 2 Claim 3:The expected success of the distributional assignment on [C,V]par[,k] is at least 4 2 (par[,k]) > 42
Lemma’s Proof - Claim 2 (1) go to claim3 General Fourier Analysis facts Multiplicative representation Claim 2: Proof: The test accepts iffz[y, F]•z[x, G]•z[x,F*•G•H] = 1 By our assumption, this happens with probability/2+½. Now, according to the definition of the expectation: Exy, f, g, h [z[y, F]•z[x, G]•z[x, F*•G•H]] = 1•(½+/2) + (-1)•(1 -(½+/2)) = Representation by Fourier Basis Claim 2 Claim 1 Claim 3:The expected success of the distributional assignment on [C,V]par[,k] is at least 4 2 (par[,k]) > 42
Lemma’s Proof - Claim 3 Claim 3: The expected success of the distributional assignment on xy is at least 42 Proof:Claim 1 gives us the initial lower bound for the expected success:
Lemma’s Proof - Claim 3 As we’ve already seen, . Hence, our lower-bound takes the form of Or alternatively, Which allows us to use the known inequality E[x2]E[x]2 and get
Lemma’s Proof - Claim 3 Byauxiliary lemmas(4|S|)-1/2 e-2|S| (1-2)|S|, i.e.|S|-1/2 (4)1/2 ·(1-2)|S|, which yields the following bound That is, Now applying claim 2 results the desired lower bound
Lemma’s Proof -Conclusion We showed that there is an assignment scheme with expected success of at least 42 , There exists an assignment that satisfies at least 42 of the tests in () > 42 Q.E.D.
Home Assignment Show it is NP-hard, for any > 0, given a 3SAT instance , to distinguish between the case where () = 1, and the case in which () < 7/8+ Hint: Let ’s variables be as in L, and ’s clauses to take the formf OR g OR ‘f* + g + h’for f and g chosen in the same way as in L,while h is chosen as follows: • h(b) = 1 for b such that f(b|V) and g(b) are both FALSE • For all other b’s, independently for each b, h(b)=1 with probability , and 0 with probability 1-