420 likes | 603 Views
A 3-Query PCP over integers a.k.a Solving Sparse Linear Systems. Prasad Raghavendra Venkatesan Guruswami. Linear Equations. Given a system of linear equations over reals, Find a solution. Easy, Use Gaussian elimination. Noise?.
E N D
A 3-Query PCP over integersa.k.aSolving Sparse Linear Systems Prasad Raghavendra Venkatesan Guruswami
Linear Equations Given a system of linear equations over reals, Find a solution. . Easy, Use Gaussian elimination.
Noise? Given a set of linear equations for which there is a solution satisfying 99% of the equations, What is the best solution that can be efficiently found? Can we atleast satisfy 1% of the equations?
10 years ago[Håstad STOC97, JACM 01] For any prime p, ε > 0, given a set of linear equations modulo p , it is NP-hard to distinguish between: • (1 – ε) – fraction of the equations can be satisfied. • 1/p + ε– fraction of the equations can be satisfied. All equations are of the form Xi + Xj = Xk + c (mod p)
Håstad’s 3-Query PCP[STOC97, JACM 01] X1 + X2 = X3 + 10 (mod p) X1 + X3 = X5 + 17 (mod p) X9 + X4 = X3 + 23 (mod p) X11 + X2 = X31 + 1 (mod p) X1 + X2 = X7 - 1 (mod p) X1 + X3 = X8 + p-10 (mod p) …….. …….. X9 + X7 = X3 + p/2 (mod p) X5 + X2 = X7 + 10 (mod p) Can be verified by 3 queries It is a 3-Query Probabilistically Checkable Proof system for NP Just have to read values of 3 variables to check an equation. Reals?
NP-hard • [Guruswami-Raghavendra 06,Feldman-Gopalan-Khot-Ponnuswami 06] • For any ε,δ > 0, Given a set of linear equations over reals, it is NP-hard to distinguish between the following two cases: • There is a solution that satisfies 1 – εfraction of the equations. • No solution satisfies more than δ fraction of the equations. Unlike Hastad’s result, equations are not sparse
Sparse Equations? • Solving sparse systems of equations important for many applications. • In the spirit of PCP theorem.. • Sparse equations have important connections to PCPs, linearity testing, Unique Games conjecture.
Sparse Equations over Reals • For any ε,δ > 0, Given a set of sparse linear equations, it is NP-hard to distinguish between: • (1 – ε) – fraction of the equations can be satisfied. • δ – fraction of the equations can be satisfied. X1 + X2 = X3 + 10 X1 + X3 = X5 + 17 … X9 + X4 = X3 + 23 X2 + X6 = X7 + 27 Some fixed constant accuracy, say ±1
Label Cover Problem U, V : set of vertices E : set of edges {1,2… R} : set of labels πe: constraint on edge e An assignment A satisfies an edge e = (u,v) E if πe (A(u)) = A(v) 1 2 3 . . R 1 6 1 2 3 . . R 3 5 3 u 3 πe 2 2 v π e (3)=2 1 5 4 7 U V Find an assignment A that satisfies maximum number of edges
Label Cover with Long Codes 1 6 3 5 3 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 πe 2 2 1 5 4 7 Write Long Codes of the Labels instead of the labels itself
Long Code A long code over a finite field F is a function: Gi : FX F… X FXFF Gi(x1 , x2, … xn ) = xi • n different long codes. • Long code over Fp represented by a table of pn values. • Linear Function.
Extending Hastad’s result to integers Use long code over integers X2 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 1 6 3 5 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 3 G2 (x1 , x2 ) = x2 πe 2 2 1 A Long Code over integers is an infinite object. 5 4 7 Just Truncate the long code!
Core Problem X2 X2 4 4 4 4 4 3 3 3 1 3 2 2 1 2 2 1 1 1 1 1 0 0 0 0 0 ? = 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 X1 X1 G2 (x1 , x2 ) = x2 G1 (x1 , x2 ) = x1 Given two supposed long codes, query 3 locations and test if they are close to some long code If test succeeds, must decode a small set of possible labels
Proof Obstacles • Linearity Testing • Decoding Labels
Linearity Testing [Blum-Luby-Rubinfeld] With G1 = {0,1}n ,G2 = {0,1}, if A is δ- far from linear function, then the test rejects with probability at least δ Given a function from an group G1 to group G2 (both abelian) A : G1 -> G2 Pick x,y uniformly at random from G1 Test if A(x) + A(y) = A(x+y)
Derandomized Linearity Testing For sufficiently large primes p, Linearity testing on truncated long code = Testing modulo p p 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 This should imply a derandomized linearity test. Total randomness used independent of the prime.
Proof Obstacles • Linearity Testing • Decoding Labels
Fourier Analysis Â(ω) = E[ A(x)e-iω•x ] • One Fourier coefficient corresponding to every linear function Pω(x) = ω•x forω =(ω1 , ω2 ,… ωR ) in FpR Â(ω) measures similarity with Pω(x) = ω•x
Hastad’s Decoding Pick a large Fourier coefficient Â(ω) of the long code, randomly pick a nonzero coordinate ωi Decode to label i Not too many large Fourier Coefficients Parseval’s Identity
Obstacle The distribution is not uniform, so a Fourier coefficient that appears is There could be exponentially many large Fourier coefficients!
Function Fourier Transform P(x) A(x) Large values in Fourier spectrum are clustered. P(x)A(x)
Decoding Labels Pick a large Fourier coefficient AP(ω) , randomly pick one of its large coordinate ωi Assign label i to the vertex • All large Fourier coefficients in the same cluster, will yield the same label with high probability. • There are very few clusters, so there are very few possible choices
Conclusion • Sparse linear equations over real numbers are hard to solve even with little noise. • In a weak sense, complete Derandomization of linearity testing is possible. • Two variable linear equations over reals?
Testing an Edge I will just assign 0 to everything! Randomly pick a vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) Test if a(x o π) = b(x) π A B 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0
I will give something that does not look linear at all Testing an Edge Randomly pick a vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) A(x o π) = B(x) X2 π 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 A B 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 G2 (x1 , x2 ) = x2 For a long code a, a(x + 1 ) = a(x) + 1 A(x o π – t11) + t1 = B(x – t21) + t2
Testing an Edge π A A(x) = (x1 + x2 + ...xR)/R B Randomly pick a vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) Randomly pick y = (y1,y2,.. yR) Test if a(x o π + y) – a(y) = b(x) + c 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 Long Code is a linear function! a(x o π + y ) – a(y) = a(x o π)
(ε,δ) – concentrated distribution All Fourier coefficients of P that are 2πδ away from origin are bounded by ε 4πδ ε
Examples • Epsilon Biased Spaces over [0,1]n are (ε, ½) – concentrated. • Epsilon Biased Spaces over Fp are (ε, 1/p) – concentrated. [BenSasson-Sudan-Vadhan-Widgerson] use Epsilon biased spaces to derandomize low degree tests(including linearity) • Any sufficiently slowly decaying probability distribution over integers.
Hardness of Label Cover [Raz 98] There exists γ > 0 such that Given a label cover instance Г =(U,V,E,R,π), it is NP-hard to distinguish between : • Г is completely satisfiable • No assignment satisfies more than 1/Rγ fraction of the edges.
Testing an Edge For a function π : [1,2,.. R] -> [1,2..R] A vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) a(x o π) = b(x) 1 2 1 2 πe a b πe 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X2 X2 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 A Linear Equation on Long code symbols 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 X1 A (x1 , x2 ) = x2 B (x1 , x2 ) = x1 a(2, 4) = b(4,2)
Hastad’s 3 Query PCP Randomly pick a vector x Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) Randomly pick y Perturb each coordinate of x o π + y independently with probability ε. To perturb just change the value to anything else in Fp Test if a(x o π + y+μ) – a(y) = b(x) + c Long codes/Dictator functions are stable against noise in the coordinates
Arithmetization Define A(x) = ωa(x) = e2πia(x)/p B(y) = ωb(y) = e2πib(y)/p Then : a(x o π + y+μ) – a(y) - b(x) = 0 if and only if 1/p ∑ (A(y)B(x) A(x o π + y+μ) )j = 1
Soundness Argument • As Linearity is tested, a(x) must have some similarity to a linear function. There have to be large Fourier coefficients Â(ω) • As we force a(x + 1) = a(x) + 1 the function a(x) is not similar to constant function. Thus, there are some nonzero ωwith large Â(ω) • There are large Â(ω) with ω having few non-zero labels.
Obstacles X2 Truncated region no more a group. For a constant fraction of x and y, (x+y) is outside the region. 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 G2 (x1 , x2 ) = x2 There could be exponentially many (in the dimension of space) large Fourier coefficients.
Modified Test P, P’ be decaying and (ε,δ)- concentrated distributions Pick x from distribution P Pick y from distribution P’ Perturb each coordinate of x o π + y independently with probability ε. To perturb, just change the value by a random number < M Test if A(x o π + y+μ) – A(y) = B(x) + c P’ P much more flatter than P’ P
Fourier Analysis (continued) Not too many large Fourier Coefficients Parseval’s Identity Inverse Fourier Transform
Fourier Domain Time Domain P(x) = 1
Properties Two dimensional long codes over F5 X2 • For a long code a, a(x + 1 ) = a(x) + 1 • Long codes/Dictator functions are stable against noise in the coordinates. 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 G2 (x1 , x2 ) = x2 X2 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 X1 G1 (x1 , x2 ) = x1