A 3-Query PCP over integers a.k.a Solving Sparse Linear Systems

A 3-Query PCP over integersa.k.aSolving Sparse Linear Systems Prasad Raghavendra Venkatesan Guruswami

Linear Equations Given a system of linear equations over reals, Find a solution. . Easy, Use Gaussian elimination.

Noise? Given a set of linear equations for which there is a solution satisfying 99% of the equations, What is the best solution that can be efficiently found? Can we atleast satisfy 1% of the equations?

10 years ago[Håstad STOC97, JACM 01] For any prime p, ε > 0, given a set of linear equations modulo p , it is NP-hard to distinguish between: • (1 – ε) – fraction of the equations can be satisfied. • 1/p + ε– fraction of the equations can be satisfied. All equations are of the form Xi + Xj = Xk + c (mod p)

Håstad’s 3-Query PCP[STOC97, JACM 01] X1 + X2 = X3 + 10 (mod p) X1 + X3 = X5 + 17 (mod p) X9 + X4 = X3 + 23 (mod p) X11 + X2 = X31 + 1 (mod p) X1 + X2 = X7 - 1 (mod p) X1 + X3 = X8 + p-10 (mod p) …….. …….. X9 + X7 = X3 + p/2 (mod p) X5 + X2 = X7 + 10 (mod p) Can be verified by 3 queries It is a 3-Query Probabilistically Checkable Proof system for NP Just have to read values of 3 variables to check an equation. Reals?

NP-hard • [Guruswami-Raghavendra 06,Feldman-Gopalan-Khot-Ponnuswami 06] • For any ε,δ > 0, Given a set of linear equations over reals, it is NP-hard to distinguish between the following two cases: • There is a solution that satisfies 1 – εfraction of the equations. • No solution satisfies more than δ fraction of the equations. Unlike Hastad’s result, equations are not sparse

Sparse Equations? • Solving sparse systems of equations important for many applications. • In the spirit of PCP theorem.. • Sparse equations have important connections to PCPs, linearity testing, Unique Games conjecture.

Sparse Equations over Reals • For any ε,δ > 0, Given a set of sparse linear equations, it is NP-hard to distinguish between: • (1 – ε) – fraction of the equations can be satisfied. • δ – fraction of the equations can be satisfied. X1 + X2 = X3 + 10 X1 + X3 = X5 + 17 … X9 + X4 = X3 + 23 X2 + X6 = X7 + 27 Some fixed constant accuracy, say ±1

Label Cover Problem U, V : set of vertices E : set of edges {1,2… R} : set of labels πe: constraint on edge e An assignment A satisfies an edge e = (u,v) E if πe (A(u)) = A(v) 1 2 3 . . R 1 6 1 2 3 . . R 3 5 3 u 3 πe 2 2 v π e (3)=2 1 5 4 7 U V Find an assignment A that satisfies maximum number of edges

Label Cover with Long Codes 1 6 3 5 3 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 πe 2 2 1 5 4 7 Write Long Codes of the Labels instead of the labels itself

Long Code A long code over a finite field F is a function: Gi : FX F… X FXFF Gi(x1 , x2, … xn ) = xi • n different long codes. • Long code over Fp represented by a table of pn values. • Linear Function.

Extending Hastad’s result to integers Use long code over integers X2 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 1 6 3 5 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 3 G2 (x1 , x2 ) = x2 πe 2 2 1 A Long Code over integers is an infinite object. 5 4 7 Just Truncate the long code!

Core Problem X2 X2 4 4 4 4 4 3 3 3 1 3 2 2 1 2 2 1 1 1 1 1 0 0 0 0 0 ? = 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 X1 X1 G2 (x1 , x2 ) = x2 G1 (x1 , x2 ) = x1 Given two supposed long codes, query 3 locations and test if they are close to some long code If test succeeds, must decode a small set of possible labels

Proof Obstacles • Linearity Testing • Decoding Labels

Linearity Testing [Blum-Luby-Rubinfeld] With G1 = {0,1}n ,G2 = {0,1}, if A is δ- far from linear function, then the test rejects with probability at least δ Given a function from an group G1 to group G2 (both abelian) A : G1 -> G2 Pick x,y uniformly at random from G1 Test if A(x) + A(y) = A(x+y)

Derandomized Linearity Testing For sufficiently large primes p, Linearity testing on truncated long code = Testing modulo p p 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 This should imply a derandomized linearity test. Total randomness used independent of the prime.

Proof Obstacles • Linearity Testing • Decoding Labels

Fourier Analysis Â(ω) = E[ A(x)e-iω•x ] • One Fourier coefficient corresponding to every linear function Pω(x) = ω•x forω =(ω1 , ω2 ,… ωR ) in FpR Â(ω) measures similarity with Pω(x) = ω•x

Hastad’s Decoding Pick a large Fourier coefficient Â(ω) of the long code, randomly pick a nonzero coordinate ωi Decode to label i Not too many large Fourier Coefficients Parseval’s Identity

Obstacle The distribution is not uniform, so a Fourier coefficient that appears is There could be exponentially many large Fourier coefficients!

Function Fourier Transform P(x) A(x) Large values in Fourier spectrum are clustered. P(x)A(x)

Decoding Labels Pick a large Fourier coefficient AP(ω) , randomly pick one of its large coordinate ωi Assign label i to the vertex • All large Fourier coefficients in the same cluster, will yield the same label with high probability. • There are very few clusters, so there are very few possible choices

Conclusion • Sparse linear equations over real numbers are hard to solve even with little noise. • In a weak sense, complete Derandomization of linearity testing is possible. • Two variable linear equations over reals?

Thank You

Testing an Edge I will just assign 0 to everything! Randomly pick a vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) Test if a(x o π) = b(x) π A B 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0

I will give something that does not look linear at all Testing an Edge Randomly pick a vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) A(x o π) = B(x) X2 π 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 A B 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 G2 (x1 , x2 ) = x2 For a long code a, a(x + 1 ) = a(x) + 1 A(x o π – t11) + t1 = B(x – t21) + t2

Testing an Edge π A A(x) = (x1 + x2 + ...xR)/R B Randomly pick a vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) Randomly pick y = (y1,y2,.. yR) Test if a(x o π + y) – a(y) = b(x) + c 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 Long Code is a linear function! a(x o π + y ) – a(y) = a(x o π)

(ε,δ) – concentrated distribution All Fourier coefficients of P that are 2πδ away from origin are bounded by ε 4πδ ε

Examples • Epsilon Biased Spaces over [0,1]n are (ε, ½) – concentrated. • Epsilon Biased Spaces over Fp are (ε, 1/p) – concentrated. [BenSasson-Sudan-Vadhan-Widgerson] use Epsilon biased spaces to derandomize low degree tests(including linearity) • Any sufficiently slowly decaying probability distribution over integers.

Hardness of Label Cover [Raz 98] There exists γ > 0 such that Given a label cover instance Г =(U,V,E,R,π), it is NP-hard to distinguish between : • Г is completely satisfiable • No assignment satisfies more than 1/Rγ fraction of the edges.

Testing an Edge For a function π : [1,2,.. R] -> [1,2..R] A vector x = (x1,x2,.. xR) Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) a(x o π) = b(x) 1 2 1 2 πe a b πe 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X2 X2 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 A Linear Equation on Long code symbols 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 X1 A (x1 , x2 ) = x2 B (x1 , x2 ) = x1 a(2, 4) = b(4,2)

Hastad’s 3 Query PCP Randomly pick a vector x Define x o π = (xπ(1), xπ(2) , xπ(3) ….xπ(R)) Randomly pick y Perturb each coordinate of x o π + y independently with probability ε. To perturb just change the value to anything else in Fp Test if a(x o π + y+μ) – a(y) = b(x) + c Long codes/Dictator functions are stable against noise in the coordinates

Arithmetization Define A(x) = ωa(x) = e2πia(x)/p B(y) = ωb(y) = e2πib(y)/p Then : a(x o π + y+μ) – a(y) - b(x) = 0 if and only if 1/p ∑ (A(y)B(x) A(x o π + y+μ) )j = 1

Soundness Argument • As Linearity is tested, a(x) must have some similarity to a linear function. There have to be large Fourier coefficients Â(ω) • As we force a(x + 1) = a(x) + 1 the function a(x) is not similar to constant function. Thus, there are some nonzero ωwith large Â(ω) • There are large Â(ω) with ω having few non-zero labels.

Obstacles X2 Truncated region no more a group. For a constant fraction of x and y, (x+y) is outside the region. 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 G2 (x1 , x2 ) = x2 There could be exponentially many (in the dimension of space) large Fourier coefficients.

Modified Test P, P’ be decaying and (ε,δ)- concentrated distributions Pick x from distribution P Pick y from distribution P’ Perturb each coordinate of x o π + y independently with probability ε. To perturb, just change the value by a random number < M Test if A(x o π + y+μ) – A(y) = B(x) + c P’ P much more flatter than P’ P

Fourier Analysis (continued) Not too many large Fourier Coefficients Parseval’s Identity Inverse Fourier Transform

Fourier Domain Time Domain P(x) = 1

Properties Two dimensional long codes over F5 X2 • For a long code a, a(x + 1 ) = a(x) + 1 • Long codes/Dictator functions are stable against noise in the coordinates. 4 4 4 4 4 3 3 3 3 3 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 X1 G2 (x1 , x2 ) = x2 X2 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 X1 G1 (x1 , x2 ) = x1

A 3-Query PCP over integers a.k.a Solving Sparse Linear Systems

A 3-Query PCP over integers a.k.a Solving Sparse Linear Systems

Presentation Transcript

INTEGERS LINEAR ALGEBRA

Solving Systems of Linear Equations

Solution of Sparse Linear Systems

Solving Linear Systems

3-2: Solving Linear Systems

Solving Linear Systems

3.4 “Solving Linear Systems with 3 Variables”

3.1 – Graphically Solving Linear Systems

Solving Large Sparse Linear Systems

Solving three-variable linear systems

Solving Linear Systems Algebraically

Solution of Sparse Linear Systems

Solving Linear Systems: Iterative Methods and Sparse Systems

Solving Linear Systems

Solving Linear Systems by Linear Combinations

Solving Linear Systems by Linear Combinations

Solving Linear Systems Algebraically

Solving Linear Systems

Solving Linear Systems

Solving Linear Systems by Linear Combinations

3-3 Graphing and Solving Systems of Linear Inequalities