Beating Brute Force Search for Formula SAT and QBF SAT

Beating Brute Force Search forFormula SAT and QBF SAT RahulSanthanam University of Edinburgh

Plan of the Talk • Introduction • A New Upper Bound for Formula SAT • The Algorithm • The Analysis • Other Applications of Technique • New Upper Bounds for QBF SAT • Future Directions

Motivation • When can we beat brute-force search for NP-hard problems? • In practice, we do need to solve SAT and QBF SAT instances in various contexts (planning, verification etc.) • What can we say formally about algorithms solving for these problems?

Satisfiability Variants • k-SAT: Satisfiability of k-CNFs • CNF SAT: Satisfiability of formulae in conjunctive normal form • Formula SAT: Satisfiability of arbitrary Boolean formulae • Circuit SAT: Satisfiability of arbitrary Boolean circuits • Constant-depth Circuit SAT: Satisfiability of constant-depth circuits • Versions of above where variables can be existentially or universally quantified (PSPACE-complete)

Satisfiability Variants Circuit SAT Constant-depth Circuit SAT Formula SAT CNF SAT k-SAT

Algorithms for Satisfiability • m: input size; n: number of variables • Brute-force algorithm runs in time 2npoly(m) • We are interested in algorithms running in time 2n-f(n)poly(m), for f(n) asymptotically as large as possible • We call f the savings of the algorithm

Main Algorithmic Paradigms • DLL: Search for a solution by iteratively setting variables, and backtracking if a full assignment does not yield a solution • Random Walk: Start with an arbitrary assignment, and iteratively modify it to satisfy random unsatisfied clauses

Algorithms for Satisfiability: State of the Art • 3-SAT: ~ 1.3n [R06] • k-SAT: Savings Ω(n/k) [PPSZ98, S99] • CNF SAT: Savings Ω(n/log(m/n)) [S05] • Constant-depth Circuit SAT: Savings Ω(n) for m = O(n) [CIP09] • Formula SAT: ? • Circuit SAT: ? • QBF SAT: ?

Our New Upper Bounds • Theorem 1: Formula SAT can be solved in time 2n-Ω(n) on formulae of linear size • Theorem 2: QBF SAT can be solved in time 2n-Ω(n/log(n)) on bounded-read formulae (i.e., each variable occurring bounded number of times) • Theorem 3: QBF SAT can be solved in time 2n-Ω(n) on “structured” bounded-read formulae

Barriers to Improved SAT Algorithms • Exponential-Time Hypothesis (ETH) formulated by [IP99,IPZ01]: 3-SAT cannot be solved in time 2o(n) • Under ETH, we cannot achieve savings n-o(n) on k-SAT or Formula SAT/Circuit SAT on linear size • [W09] shows that savings of ω(log(n)) for Circuit SAT for superpoly m would imply NEXP not in SIZE(poly) (and analogously for Formula SAT)

The Upper Bound for Formula SAT • Theorem 1 (restated): There is a constant k > 2 for which there is a deterministic algorithm solving Formula SAT with savings n/ck on formulae of size cn • Note: if we could achieve k = 0.1, then by [W09] we would have new formula size lower bounds for ENP

The Algorithm • Search(φ) • Simplify φ according to simplification rules • If φ ↔ 1, return “yes” and halt • If φ ↔ 0, return “no” • Let x be the variable with max no of occurrences • Search (φ|x=0) • Search (φ|x=1) • Return “no”

Simplification Rules • 1 Λφ → φ • 1 V φ→ 1 • 0 Λφ→ 0 • 0 V φ→φ • x V φ→ x V φ|x=0 • x Λφ→ x Λφ|x=1 1-simplification rules 0-simplification rules Variable simplification rules

An Example (x V y) Λ (x V (x Λ y’ Λ z))

An Example (x V y) Λ (x V (0 Λ y’ Λ z)) (applying variable simpl. rule)

An Example (x V y) Λ (x V 0) (applying 0-simpl. rule)

An Example (x V y) Λ x (applying 0-simpl. rule)

An Example (x V y) Λ x (0 V y) Λ 0 1st recursive call

An Example (x V y) Λ x y Λ 0

An Example (x V y) Λ x 0

An Example (x V y) Λ x 0 (1 V y) Λ 1 2nd recursive call

An Example (x V y) Λ x 0 1 Λ 1

An Example (x V y) Λ x 0 1 Success!

Analyzing Recursion Tree Size • Typically done by solving a recurrence on m and n • Instead, we derive inspiration from the method of random restrictions (though our algorithm itself is deterministic) • A random restriction is a probability distribution on partial assignments to variables

Pure Random Restrictions • Let 0 < p < 1 be a parameter. Given n input variables, choose each one independently to be 1 w.p. (1-p)/2, 0 w.p. (1-p)/2 and free w.p p • Note that the choice of which variables to set is uniform, as well as the choice of which value to set a given variable to

Formula Size Lower Bounds via Pure Random Restrictions • [S61] proved that when a formula of size m is hit by a random restriction with parameter p, expected size of simplified formula is O(p1.5m) • Implies Parity requires formulae of size Ω(n1.5), by choosing p = O(1/n) • [H98] proved optimal result: expected size of simplified formula is O(p2m) • Implies best known formula size lower bound of n3-o(1) for a function in P

Formula Size Lower Bounds via Pure Random Restrictions • [S61] proved that when a formula of size m is hit by a random restriction with parameter p, expected size of simplified formula is O(p1.5m) • [H98] proved optimal result: expected size of simplified formula is O(p2m) • Note that for either result, if m = O(n), there is constant p such that expected size of simplified formula << pn

Adaptively Random Restrictions • Choice of which variable to set next is not uniform • Indeed, in our algorithm, setting of variables is deterministic, according to number of occurrences • Choice of value, however, is uniformly random • Greedy a.r.r: Variables are set sequentially in decreasing order of no. of occurrences • [S61] and [H98] results hold also for (1-p)n -step greedy a.r.r

Random Restrictions and Recursion Tree Size: Basic Idea • The simplified formulae at depth d of the recursion tree correspond to d-step greedy a.r.r • Lemma: After (1-p)n steps of greedy a.r.r, size of simplified formula << pn with prob. 1-2-Ω(n) (strong concentration version of Subbotovskaya’s result) • This impliesnon-trivial bound on size of recursion tree

Why a Concentration Bound Helps . . . . (1-p)n Good node: Simplified formula at node has size < pn/2 Bad node: Simplified formula has size >= pn/2

Why a Concentration Bound Helps . . . . (1-p)n Say we could show that fraction of bad nodes at depth (1-p)n is at most q. Then size of decision tree is at most 2n-pn/2 + q2n, which is 2n-Ω(n) if q=2-Ω(n)

Beating Brute Force Search for Exact Count • Count(φ;n) • Simplify φ according to simplification rules • If φ ↔ 1, return 2n • If φ ↔ 0, return 0 • Let x be the variable with max no of occurrences • Return Count(φ|x=0;n-1) + Count(φ|x=1;n-1) • Analysis same as before, giving same runtime

Detour: Decision Trees x1 x2 0 x3 0 0 1 Φ = x1 Λ x2 Λ x3

Average Case Lower Bounds for Formula Size • Recursion tree of Search algorithm yields decision tree for function computed by input formula • Proof of Theorem 1 shows that formulae of linear size have decision trees of size 2n-Ω(n)

Average Case Lower Bounds for Formula Size • Proof of Theorem 1 shows that formulae of linear size have decision trees of size 2n-Ω(n) • Let advantage of a decision tree T on a function f be Pr(T=f) – Pr(T≠f) • Lemma: Any decision tree of size s has advantage at most s/2n on Parity • Corollary : Any formula of linear size has advantage 2-Ω(n) on Parity

Analysis First, Algorithm Afterwards • Could other random restriction results be used to get new upper bounds? • Hastadhas a famous result showing that constant-depth circuits simplify under (pure) random restrictions • From this, we “extract” a randomized algorithm solving Constant-depth Circuit SAT with savings Ω(n1/(d+1)), where d is depth

A New Upper Bound for QBF SAT • Theorem 2 (re-stated): There is an algorithm running in time 2n-Ω(n/log(n)) solving QBF SAT on bounded-read formulae

Proof Idea for Theorem 2 • We would like to use random restriction method again, but we have no control over order in which variables are to be set • Let k be an upper bound on number of occurrences for any variable • By fixing all but t variables, our new formula will have size at most kt • But how does this help?

Proof Idea for Theorem 2 • Idea: Memoization • When t<n/(5k log(n)), simplified formula has size at most n/(5 log(n)), and hence can be represented by < n/4 bits • We can pre-compute answers to all such small QBF SAT questions in time 2n/2 and store them in random-access memory • Now, given an instance φ, we need only do exhaustive search over first n-t variables, replacing the rest of the search by a memory access

Structured Instances • Theorem 2 only gives Ω(n/log(n)) savings for bounded-read formulae • Can we get Ω(n) savings? • A set S of instances is structured if every instance of length n in S has a description of size o(n) from which it can be recovered efficiently • Eg., set of all sparse graphs is structured

Linear Savings for Structured Instances • Theorem 3 (re-stated): There is an algorithm for QBF SAT which has savings Ω(n) on any set of structured bounded-read formulae • Proof Idea: Formula obtained by fixing the first εn quantified variables of a QBF is also “somewhat structured” • In the memoization phase, we don’t need to store answers to all small formulae, but only for reasonably structured ones

Future Directions • Using the method of random restrictions in other settings or to get better parameters • More connections between upper bounds and lower bounds • Better upper bounds for QBF SAT

Thank You!

Beating Brute Force Search for Formula SAT and QBF SAT

Beating Brute Force Search for Formula SAT and QBF SAT

Presentation Transcript

SAT

COSC 3100 Brute Force and Exhaustive Search

SAT

Beating Brute Force Search for QBF Satisfiability

SAT

Local Search Methods for SAT

SAT

SAT

SAT

Challenges for SAT and QBF

sat

SAT

sat

SAT

SAT ,SAT preparation , SAT overview

Brute Force

Brute Force Search

Local Search for Distributed SAT