Average-case Complexity

Average-case Complexity Luca Trevisan UC Berkeley

Distributional Problem <P,D> P computational problem • e.g. SAT D distribution over inputs • e.g. n vars 10n clauses

Positive Results: • Algorithm that solves P efficiently on most inputs • Interesting when P useful problem, D distribution arising “in practice” Negative Results: • If <assumption>, then no such algorithm • P useful, D natural • guide algorithm design • Manufactured P,D, • still interesting for crypto, derandomization

Holy Grail If there is algorithm A that solves P efficiently on most inputs from D Then there is an efficient worst-case algorithm for [the complexity class] P [belongs to]

Part (1) In which the Holy Grail proves elusive

The Permanent Perm (M) := SsPi M(i,s(i)) Perm() is #P-complete Lipton (1990): If there is algorithm that solves Perm() efficiently on most random matrices, Then there is an algorithm that solves it efficiently on all matrices (and BPP=#P)

Lipton’s Reduction Suppose operations are over finite field of size >n A is good-on-average algorithm (wrong on < 1/(10(n+1)) fraction of matrices) Given M, pick random X, compute A(M+X), A(M+2X),…,A(M+(n+1)X) Whp the same as Perm(M+X),Perm(M+2X),…,Perm(M+(n+1)X)

Lipton’s Reduction Given Perm(M+X),Perm(M+2X),…,Perm(M+(n+1)X) Find univariate degree-n polynomialp such thatp(t) = Perm(M+tX) for all t Output p(0)

Improvements / Generalizations • Can handle constant fraction of errors[Gemmel-Sudan] • Works for PSPACE-complete, EXP-complete,…[Feigenbaum-Fortnow, Babai-Fortnow-Nisan-Wigderson]Encode the problem as a polynomial

Strong Average-Case Hardness • [Impagliazzo, Impagliazzo-Wigderson] Manufacture problems in E, EXP, such that • Size-t circuit correct on ½ + 1/t inputs implies • Size poly(t) circuit correct on all inputs Motivation: [Nisan-Wigderson] P=BPP if there is problem in E of exponential average-case complexity

Strong Average-Case Hardness • [Impagliazzo, Impagliazzo-Wigderson] Manufacture problems in E, EXP, such that • Size-t circuit correct on ½ + 1/t inputs implies • Size poly(t) circuit correct on all inputs Motivation: [Impagliazzo-Wigderson] P=BPP if there is problem in E of exponential average worst-case complexity

Open Question 1 • Suppose there are worst-case intractable problems in NP • Are there average-case intractable problems?

Strong Average-Case Hardness • [Impagliazzo, Impagliazzo-Wigderson] Manufacture problems in E, EXP, such that • Size-t circuit correct on ½ + 1/t inputs implies • Size poly(t) circuit correct on all inputs • [Sudan-T-Vadhan] • IW result can be seen as coding-theoretic • Simpler proof by explicitly coding-theoretic ideas

Encoding Approach • Viola proves that an error-correcting code cannot be computed in AC0 • The exponential-size error-correcting code computation not possible in PH

Problem-specific Approaches? [Ajtai] • Proves that there is a lattice problem such that: • If there is efficient average-case algorithm • There is efficient worst-case approximation algorithm

Ajtai’s Reduction • Lattice Problem • If there is efficient average-case algorithm • There is efficient worst-case approximation algorithm The approximation problem is in NPIcoNP Not NP-hard

Holy Grail • Distributional Problem: • If there is efficient average-case algorithm • P=NP(or NP in BPP, or NP has poly-size circuits,…) Already seen: no “encoding” approach works Can extensions of Ajtai’s approach work?

A Class of Approaches • L problem in NP, D distribution of inputs • R reduction of SAT to <L,D>: • Given instance f of SAT, • R produces instances x1,…,xk of L, each distributed according to D • Given L(x1),…,L(x1), R is able to decide f If there is good-on-average algorithn for <L,D>, we solve SAT in polynomial time [cf. Lipton’s work on Permanent]

A Class of Approaches • L,W problems in NP, D (samplable) distribution of inputs • R reduction of W to <L,D> • Given instance w of W, • R produces instances x1,…,xk of L, each distributed according to D • Given L(x1),…,L(x1), R is able to decide w If there is good-on-average algorithm for <L,D>, we solve W in polynomial time; Can W be NP-complete?

A Class of Approaches • Given instance w of W, • R produces instances x1,…,xk of L, each distributed according to D • Given L(x1),…,L(x1), R is able to decide w Given good-on-average algorithm for <L,D>, we solve W in polynomial time; If we have such reduction, and W is NP-complete, we have Holy Grail! Feigenbaum-Fortnow: W is in “coNP”

Feigenbaum-Fortnow • Given instance w of W, • R produces instances x1,…,xk of L, each distributed according to D • Given L(x1),…,L(x1), R is able to decide w • Using R, Feigenbaum-Fortnow design a 2-round interactive proof with advice for coW • Given w, Prover convinces Verifier that R rejects w after seeing L(x1),…,L(x1)

Feigenbaum-Fortnow • Given instance w of W, • R produces instances x of L distributed as in D • w in L iff x in L Suppose we know PrD[ x in L]= ½ P V R(w) = x1 R(w) = x2 . . . R(w) = xm x1, x2,. . . , xm w (Yes,w1),No,. . . , (Yes, wm) Accept iff all simulations of R rejectand m/2 +/- sqrt(m) answers are certified Yes

Feigenbaum-Fortnow • Given instance w of W, p:= Pr[ xi in L] • R produces instances x1,…,xk of L, each distrib. according to D • Given L(x1),…,L(xk), R is able to decide w P V R(w) -> x11,…,xk1 w x11,…,xkm . . . R(w) -> x1m,…,xkm (Yes,w11),…,NO Accept iff-pkm +/- sqrt(pkm) YES with certificates -R rejects in each case

Generalizations • Bogdanov-Trevisan: arbitrary non-adaptive reductions • Main Open Question:What happens with adaptive reductions?

Open Question 1 Prove the following: Suppose: W,L are in NP, D is samplable distribution, R is poly-time reduction such that • If A solves <L,D> on 1-1/poly(n) frac of inputs • Then R with oracle A solves W on all inputs Then W is in “coNP”

By the Way • Probably impossible by current techniques: If NP not contained in BPP There is a samplable distribution D and an NP problem L Such that <L,D> is hard on average

By the Way • Probably impossible by current techniques: If NP not contained in BPP There is a samplable distribution D and an NP problem L Such that for every efficient A A makes many mistakes solving L on D

By the Way • Probably impossible by current techniques: If NP not contained in BPP There is a samplable distribution D and an NP problem L Such that for every efficient A A makes many mistakes solving L on D • [Guttfreund-Shaltiel-TaShma] Prove: If NP not contained in BPP For every efficient A There is a samplable distribution D Such that A makes many mistakes solving SAT on D

Part (2) In which we amplify average-case complexity and we discuss a short paper

Revised Goal • Proving“If NP contains worst-case intractable problems, then NP contains average-case intractable problems”Might be impossible • Average-case intractability comes in different quantitative degrees • Equivalence?

Average-Case Hardness What does it mean for <L,D> to be hard-on-average? Suppose A is efficient algorithm Sample x ~ D Then A(x) is noticeably likely to be wrong How noticeably?

Average-Case Hardness Amplification Ideally: • If there is <L,Uniform>, L in NP, such that every poly-time algorithm (poly-size circuit) makes > 1/poly(n) mistakes • Then there is <L’,Uniform>, L’ in NP, such that every poly-time algorithm (poly-size circuit) makes > ½ - 1/poly(n) mistakes

Amplification “Classical” approach: Yao’s XOR Lemma Suppose: for every efficient APrD [ A(x) = L(x) ] < 1- d Then: for every efficient A’ PrD [ A’(x1,…,xk) = L(x1) xor … xor L(xk) ] < ½ + (1 - 2d)k + negligible

Yao’s XOR Lemma Suppose: for every efficient APrD [ A(x) = L(x) ] < 1- d Then: for every efficient A’ PrD [ A’(x1,…,xk) = L(x1) xor … xor L(xk) ] < ½ + (1 - 2d)k + negligible Note: computing L(x1) xor … xor L(xk) need not be in NP, even if L is in NP

O’Donnell Approach Suppose: for every efficient APrD [ A(x) = L(x) ] < 1- d Then: for every efficient A’ PrD [ A’(x1,…,xk) = g(L(x1), …, L(xk)) ] < ½ + small(k, d) For carefully chosen monotone function g Now computing g(L(x1),…, L(xk)) is in NP, if L is in NP

Amplification (Circuits) Ideally: • If there is <L,Uniform>, L in NP, such that every poly-time algorithm (poly-size circuit) makes > 1/poly(n) mistakes • Then there is <L’,Uniform>, L’ in NP, such that every poly-time algorithm (poly-size circuit) makes > ½ - 1/poly(n) mistakes Achieved by [O’Donnell, Healy-Vadhan-Viola] for poly-size circuits

Amplification (Algorithms) • If there is <L,Uniform>, L in NP, such that every poly-time algorithm makes > 1/poly(n) mistakes • Then there is <L’,Uniform>, L’ in NP, such that every poly-time algorithm makes > ½ - 1/polylog(n) mistakes [T] [Impagliazzo-Jaiswal-Kabanets-Wigderson] ½ - 1/poly(n) but for PNP||

Open Question 2 Prove: • If there is <L,Uniform>, L in NP, such that every poly-time algorithm makes > 1/poly(n) mistakes • Then there is <L’,Uniform>, L’ in NP, such that every poly-time algorithm makes > ½ - 1/poly(n) mistakes

Completeness • Suppose we believe there is L in NP, D distribution, such that <L,D> is hard • Can we point to a specific problem C such that <C,Uniform> is also hard?

Completeness • Suppose we believe there is L in NP, D distribution, such that <L,D> is hard • Can we point to a specific problem C such that <C,Uniform> is also hard? Must put restriction on D, otherwise assumption is the same as P != NP

Side Note Let K be distribution such that x has probability proportional to 2-K(x) Suppose A solves <L,K> on 1-1/poly(n) fraction of inputs of length n Then A solves L on all but finitely many inputs Exercise: prove it

Completeness • Suppose we believe there is L in NP, Dsamplable distribution, such that <L,D> is hard • Can we point to a specific problem C such that <C,Uniform> is also hard?

Completeness • Suppose we believe there is L in NP, Dsamplable distribution, such that <L,D> is hard • Can we point to a specific problem C such that <C,Uniform> is also hard? Yes we can! [Levin, Impagliazzo-Levin]

Levin’s Completeness Result • There is an NP problem C, such that • If there is L in NP, D computable distribution, such that <L,D> is hard • Then <C,Uniform> is also hard

Reduction Need to define reduction that preserves efficiency on average (Note: we haven’t yet defined efficiency on average) R is a (Karp) average-case reduction from <A,DA> to <B,DB> if • x in A iff R(x) in B • R(DA) is “dominated” by DB: Pr[ R(DA)=y] < poly(n) * Pr [DB = y]

Reduction R is an average-case reduction from <A, DA> to <B, DB> if • x in A iff R(x) in B • R(DA) is “dominated” by DB: Pr[ R(DA)=y] < poly(n) * Pr [DB = y] Suppose we have good algorithm for <B, DB> Then algorithm also good for <B,R(DA)> Solving <A, DA> reduces to solving <B,R(DA)>

Reduction If Pr[ Y=y] < poly(n) * Pr [DB = y] and we have good algorithm for <B, DB > Then algorithm also good for <B,Y> Reduction works for any notion of average-case tractability for which above is true.

Levin’s Completeness Result Follow presentation of [Goldreich] • If <BH,Uniform> is easy on average • Then for every L in NP, every D computable distribution, <L,D> is easy on average BH is non-deterministic Bounded Halting: given <M,x,1t>,does M(x) accept with t steps?

Average-case Complexity

Average-case Complexity

Presentation Transcript

Complexity of Trauma “Dog Bite” Clinical Case

Junta Distributions and the Average Case Complexity of Manipulating Elections

R-trees: An Average Case Analysis

Below Average Above Average Above Average Average Above Average Average Above Average

“The average teacher explains complexity; the gifted teacher reveals simplicity .”

A Case for Complexity

AVERAGE

“average”

Average case analysis of randomized quicksort

Quicksort A Case Study in Randomization and Average-Case Complexity Bubblesort O ( n )

Junta Distributions and the Average-Case Complexity of Manipulating Elections

Great Ideas in Computing Average Case Analysis

Probability theory and average-case complexity

Average

Probabilistic (Average-Case) Analysis and Randomized Algorithms

Opening Case: Amazon – Not Your Average Bookstore

Sorting Algorithms and Average Case Time Complexity

Average

Average-case Complexity

Improving The Average-Case Using Worst-Case Aware Prefetching

Worst-Case Optimal and Average-Case Efficient Geometric Ad-Hoc Routing

Worst case to Average case Reductions for Polynomials