A primer on SAT

The puzzling SAT puzzle

You have a bunch of switches: $x_1, \dots, x_k$
Each switch turn on/off different lights.
You want to turn at least one light on in each line:


+		+	-
+	-	-	+
-	+		+
+			+

This problem is known as the SAT problem.

SAT as Propositional Logic

Each switch is a Boolean variable xx
- true is switch on
- false is switch off
Each cell is a literal: either $x$ or $\neg x$
Each row is a clause, ie a disjunction of literal: “at least one”
The game = a conjunction of clauses, aka a CNF Formula

$x$	$y$	$z$	$w$
+		+	-	$x \vee z \vee \neg w$
+	-	-	+	$x \vee \neg y \vee \neg z \vee w$
-	+		+	$\neg x \vee y \vee w$
+			+	$x \vee w$

$(x \vee z \vee \neg w) \wedge (x \vee \neg y \vee \neg z \vee w) \wedge (\neg x \vee y \vee w) \wedge (x \vee w)$

SAT is Hard

Obvious method for SAT: try every combination of switches.
Takes $2^n$ tries for $n$ switches. Does not scale.
Surely there is a better algorithm?
- in practice yes (stay tunned)
- in theory, not sure… every known algorithm needs $2^{n - o(1)}$ steps.

Strong Exponential Time Hypothesis: we believe SAT cannot be solved in time $2^{\alpha n}$ for $\alpha < 1$ in the worst case.

SAT as a “Programming Language”

A reason for the hardness of SAT: its expressivity.

We can encode many things with SAT.
Sudoku example:
- $x_{i,j}^k$ is true if entry $i,j$ is equal to $k$
- $\neg x_{i,j}^1 \vee \neg x_{i,j}^2$ : entry $(i,j)$ cannot be both set to $1$ and $2$
- $x_{i,j}^1 \vee x_{i,j}^2 \vee \dots \vee x_{i,j}^9$ : entry $(i,j)$ has at least one value.
- $\neg x_{1,j}^k \vee \neg x_{1,p}^k$ for $p<j$ : entries $(1,j)$ and $(1,p)$ cannot be both set at value $k$ (because they are on the same line)
- $\dots$
A solution to a Sudoku = a solution to the CNF formula.

We can “efficiently” encode many types of problem in a CNF formula.

Modeling reasoning

Sudoku: specify constraints of the system and let the solver figures out an answer.
Many systems are described through constraints.
Configuration problems:
- Define a set of “possible” products with options.
- Physical, industrial, marketing constraints.
- E.g.: bikes, computers, software (compilation options).
- “Color gold only possible if fancy frame and fancy bell are choosen”: $(\neg x_{gold} \vee x_{fancyframe}) \wedge (\neg x_{gold} \vee x_{francybell})$
We can use a more expressive language (“Constraint Satisfaction Problem”), see e.g. xcsp, which is often “compiled” into SAT before solving.

Modeling computation

We can actually encode any non-deterministic Turing machine:

0	0	1	0	0	1	0 1	0	0	0	0	0	0	0	1	0	0
					↥	↥	↥
					$q'$	$q$	$q'$
1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17
					$x_{q''}^{6,t+1}$	$x_{q}^{7,t}$	$x_{q'}^{8,t+1}$

$x_{q,0}^{7,t}$ : at step $t$ , the head is a position $7$ in state $q$ .
$r_0^{7,t}$ : the position 7 at time $t$ contains a $0$ .
Transitions
- $\tau_1 = (q,0): (q', 1, \rightarrow)$
- $\tau_2 = (q,0): (q'', 0, \leftarrow)$
$(x_{q}^{7,t} \wedge r_0^{7,t}) \Rightarrow$ $(x_{q'}^{8,t+1} \wedge r_1^{7,t+1}) \vee$ $(x_{q''}^{6,t+1} \wedge r_0^{7,t+1})$

NP-completeness

This shows that SAT is NP-complete! (Cook, Levin, 1973)

For any NP problem (“resonable”), we can write a formula that is SAT if and only if the problem has a solution.
Most people believes that this cannot be done in polynomial time.
Why should we try to solve such a hard problem?
- Efficient algorithms for SAT in practice = efficient algorithm for many other problems
- Very simple structure: allows for low level optimization.

We need SAT solvers!

A Branching algorithm

$F = (x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)$

(x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle \rangle

(x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle x_1 \mapsto 1 \rangle

(x_2 \vee x_4) \wedge (x_2 \vee \neg x_4) \wedge (\neg x_2)

\langle x_1 \mapsto 1 \rangle

(x_2 \vee x_4) \wedge (x_2 \vee \neg x_4) \wedge (\neg x_2)

\langle x_1 \mapsto 1,x_2 \mapsto 1 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 1 \rangle

(x_2 \vee x_4) \wedge (x_2 \vee \neg x_4) \wedge (\neg x_2)

\langle x_1 \mapsto 1,x_2 \mapsto 0 \rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0 \rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 0 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 0 \rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 1 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 1 \rangle

(x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle x_1 \mapsto 0 \rangle

(\neg x_2 \vee \neg x_3 \vee x_4) \wedge (x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 0 \rangle

(\neg x_2 \vee \neg x_3 \vee x_4) \wedge (x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 0,x_4 \mapsto 1 \rangle

(x_3)

\langle x_1 \mapsto 0,x_4 \mapsto 1 \rangle

(x_3)

\langle x_1 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 1 \rangle

()

\langle x_1 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 1 \rangle

Adding Unit Propagation

(x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle \rangle

(x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle x_1 \mapsto 1 \rangle

(x_2 \vee x_4) \wedge (x_2 \vee \neg x_4) \wedge

(\neg x_2)

\langle x_1 \mapsto 1 \rangle

(x_2 \vee x_4) \wedge (x_2 \vee \neg x_4) \wedge

(\neg x_2)

\langle x_1 \mapsto 1,

x_2 \mapsto 0

\rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0 \rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 0 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 0 \rangle

(x_1 \vee x_4) \wedge (\neg x_1 \vee \neg x_2) \wedge (x_1 \vee x_3 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee \neg x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle x_1 \mapsto 0 \rangle

(\neg x_2 \vee \neg x_3 \vee x_4) \wedge (x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 0 \rangle

(\neg x_2 \vee \neg x_3 \vee x_4) \wedge (x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 0,x_4 \mapsto 1 \rangle

(x_3)

\langle x_1 \mapsto 0,x_4 \mapsto 1 \rangle

(x_3)

\langle x_1 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 1 \rangle

()

\langle x_1 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 1 \rangle

DPLL

If every clause are satisfied, return $1$ .
If one clause is refuted, return $0$ .

UNIT PROPAGATION: If there is a clause with one variable not set, pick it and satisfy it, e.g $C = \neg x$ , pick $x=0,b=0$

PURE LITERAL ELIMINATION: If there is a variable $x$ appearing only positively (resp negatively), pick $x$ and $b=1$ (resp. $b=0$ ).

Otherwise, pick a variable $x$ and $b \in \{0,1\}$ using a good heuristic
Try to recursively find a solution to $F[x \gets b]$ .
If no solution, backtrack and try $F[x \gets 1-b]$ .

Problem: Backtracks are dealt with independently, some insights are lost.

CDCL Solvers

Conflict Driven Clause Learning: try to learn why.

$(x \vee y) \wedge (\neg x \vee u \neg z_1) \wedge (z_1 \vee \neg z_2) \wedge (z_2 \vee \neg y) \vee (z_2 \vee z_1 \vee y)$
Set $x=1, u=0$ : Unit Propagation of $z_1=0, z_2=0, y=0$ : conflict found.
We know that, independently of the value of $z_1,z_2,y$ , $(x=1,u=0)$ cannot be part of a solution.
Add clause $\neg x \vee u$ to catch it sooner.
- If we later set $x=1$ , we have UP on $u=1$ !
- Speed up later branches.
- Reduce overhead of recursion.

CDCL-solvers (+good heuristics, +good data structures, +“agressive restarts”):

really efficient on many instances
bruteforce at its finest
naturally unravel hidden structures in the encoding

#SAT

Counting the number of solutions


+		+	-
+	-	-	+
-	+		+
+			+

With 1: {1,2}, {1,2,3}, {1,2,3,4}, {1,2,4}, {1,4}, {1,3,4}
Without 1 and with 2: {2,3,4}
Without 1,2 and with 3: {3,4}

This problem is known as #SAT: given a CNF $F$ , return $\#F$ , its number of solutions.

Why counting?

Modeling constraints: counting enables better “reasoning”.

Configuration problem:
- Find some options compatible with a lot of product
- Measure “probability” of an option (number of product having it)
- Counting (with weights) = probabilistic reasoning
Encode other reasoning tasks such as Bayesian inference

How hard is it?

Counting is way harder than finding a solution
Intution: we have to explore the entire search space
Complexity theory: #P-complete problem.
Toda’s Theorem: one oracle call to #P allows to solve the whole polynomial hierarchy.

Counting by enumeration

$(x_1 \vee x_3 \vee \neg x_4) \wedge (x_1 \vee x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)$

(x_1 \vee x_3 \vee \neg x_4) \wedge (x_1 \vee x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle \rangle

(x_1 \vee x_3 \vee \neg x_4) \wedge (x_1 \vee x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle x_1 \mapsto 1 \rangle

(x_2 \vee x_4)

\langle x_1 \mapsto 1 \rangle

(x_2 \vee x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 1 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 1 \rangle

(x_2 \vee x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0 \rangle

(x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0 \rangle

(x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 1 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_4 \mapsto 1 \rangle

(x_1 \vee x_3 \vee \neg x_4) \wedge (x_1 \vee x_4) \wedge (\neg x_1 \vee x_2 \vee x_4) \wedge (x_1 \vee \neg x_2 \vee \neg x_3 \vee x_4)

\langle x_1 \mapsto 0 \rangle

(\neg x_2 \vee \neg x_3 \vee x_4) \wedge (x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 0 \rangle

(\neg x_2 \vee \neg x_3 \vee x_4) \wedge (x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 0,x_4 \mapsto 1 \rangle

(x_3)

\langle x_1 \mapsto 0,x_4 \mapsto 1 \rangle

(x_3)

\langle x_1 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 1 \rangle

()

\langle x_1 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 1 \rangle

Counting by enumeration

$\#F = \#F[x=0]+\#F[x=1]$

Run DPLL with a counter.
When a model is found: add $2^Y$ to the counter where $Y$ are unassigned variables.
Keep backtracking until every branch are explored.
Use unit propagation: $\#F = \#F[x=1]$ if there is a clause $(x)$ .
Pure literal cannot be eliminated: $(x \vee y) \wedge (x \vee \neg z)$ has solutions with $x=0$ !

Improvements possible: sometimes, we are redoing expensive computation.

Caching in Exhaustive DPLL

Cache precomputed values
Example: F=(x1∨x2∨x3)∧(¬x1∨x2∨x3)∧(x1∨¬x4)∧(¬x1∨¬x4)F = (x_1 \vee x_2 \vee x_3) \wedge (\neg x_1 \vee x_2 \vee x_3) \wedge (x_1 \vee \neg x_4) \wedge (\neg x_1 \vee \neg x_4)
- $F[x_1=1] = (x_2 \vee x_3) \wedge \neg x_4$ has 3 solutions
- $F[x_1=0]=$ $(x_2 \vee x_3) \wedge \neg x_4 = F[x_1=1]$ , also has $3$ solutions!
6 solutions in total.

Exploiting SAT solver efficiency

Exhaustive DPLL for $F = (\neg x_1 \vee x_2 \vee x_3) \wedge (\neg x_2 \vee \neg x_5) \wedge (x_2 \vee x_4 \vee x_5) \wedge (\neg x_3 \vee \neg x_5) \wedge (x_1) \wedge (\neg x_2 \vee \neg x_4) \wedge (\neg x_3 \vee \neg x_4)$

(\neg x_1 \vee x_2 \vee x_3) \wedge (\neg x_2 \vee \neg x_5) \wedge (x_2 \vee x_4 \vee x_5) \wedge (\neg x_3 \vee \neg x_5) \wedge (x_1) \wedge (\neg x_2 \vee \neg x_4) \wedge (\neg x_3 \vee \neg x_4)

\langle \rangle

(\neg x_1 \vee x_2 \vee x_3) \wedge (\neg x_2 \vee \neg x_5) \wedge (x_2 \vee x_4 \vee x_5) \wedge (\neg x_3 \vee \neg x_5) \wedge (x_1) \wedge (\neg x_2 \vee \neg x_4) \wedge (\neg x_3 \vee \neg x_4)

\langle x_1 \mapsto 1 \rangle

(\neg x_3 \vee \neg x_5) \wedge (\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (\neg x_2 \vee \neg x_5) \wedge (x_2 \vee x_4 \vee x_5)

\langle x_1 \mapsto 1 \rangle

(\neg x_3 \vee \neg x_5) \wedge (\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (\neg x_2 \vee \neg x_5) \wedge (x_2 \vee x_4 \vee x_5)

\langle x_1 \mapsto 1,x_5 \mapsto 0 \rangle

(x_2 \vee x_4) \wedge (\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4)

\langle x_1 \mapsto 1,x_5 \mapsto 0 \rangle

(x_2 \vee x_4) \wedge (\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 1,x_5 \mapsto 0 \rangle

(\neg x_4) \wedge (\neg x_3 \vee \neg x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 1,x_5 \mapsto 0 \rangle

(\neg x_4) \wedge (\neg x_3 \vee \neg x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 1,x_4 \mapsto 0,x_5 \mapsto 0 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 1,x_4 \mapsto 0,x_5 \mapsto 0 \rangle

(x_2 \vee x_4) \wedge (\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_5 \mapsto 0 \rangle

(x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_5 \mapsto 0 \rangle

(x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_3 \mapsto 1,x_5 \mapsto 0 \rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_3 \mapsto 1,x_5 \mapsto 0 \rangle

(\neg x_4) \wedge (x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 0,x_5 \mapsto 0 \rangle

()

\langle x_1 \mapsto 1,x_2 \mapsto 0,x_3 \mapsto 1,x_4 \mapsto 0,x_5 \mapsto 0 \rangle

(\neg x_3 \vee \neg x_5) \wedge (\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (\neg x_2 \vee \neg x_5) \wedge (x_2 \vee x_4 \vee x_5)

\langle x_1 \mapsto 1,x_5 \mapsto 1 \rangle

(\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (\neg x_3) \wedge (\neg x_2)

\langle x_1 \mapsto 1,x_5 \mapsto 1 \rangle

(\neg x_2 \vee \neg x_4) \wedge (x_2 \vee x_3) \wedge (\neg x_3 \vee \neg x_4) \wedge (\neg x_3) \wedge (\neg x_2)

\langle x_1 \mapsto 1,x_3 \mapsto 0,x_5 \mapsto 1 \rangle

(x_2) \wedge (\neg x_2 \vee \neg x_4) \wedge (\neg x_2)

\langle x_1 \mapsto 1,x_3 \mapsto 0,x_5 \mapsto 1 \rangle

(x_2) \wedge (\neg x_2 \vee \neg x_4) \wedge (\neg x_2)

\langle x_1 \mapsto 1,x_2 \mapsto 1,x_3 \mapsto 0,x_5 \mapsto 1 \rangle

() \wedge (\neg x_4)

\langle x_1 \mapsto 1,x_2 \mapsto 1,x_3 \mapsto 0,x_5 \mapsto 1 \rangle

Spend time maintaining caching in dead branches
Call to SAT solver at each level:
- cut conflict branches
- CDCL : learn some conflict that can be used
- E.g., SAT call learns $\neg x_5$ in the first call.

Connected components

Observation: $F = F_1 \wedge F_2$ with $var(F_1) \cap var(F_2) = \emptyset$ then

$\#F = \#F_1 \times \#F_2$

because solutions of $F_1$ and $F_2$ can be recombined independently to form a solution of $F$ .

Add connected components detection in exhaustive DPLL.
Reduces the number of operations.

Example with connected components

$(x_1 \vee x_2) \wedge (x_1 \vee x_3)$	$\wedge$	$(x_4 \vee x_5) \wedge (x_4 \vee x_6)$	$\wedge$	$(x_7 \vee x_9) \wedge (\neg x_7 \vee x_8)$
5 solutions	$\times$	5 solution	$\times$	4 solutions

$5 \times 5 \times 4 = 100$

(x_1 \vee x_2) \wedge (x_1 \vee x_3) \wedge (x_4 \vee x_5) \wedge (x_4 \vee x_6) \wedge (x_7 \vee x_9) \wedge (\neg x_7 \vee x_8)

\langle \rangle

(x_4 \vee x_6) \wedge (x_1 \vee x_2) \wedge (x_4 \vee x_5) \wedge (x_7 \vee x_9) \wedge (\neg x_7 \vee x_8) \wedge (x_1 \vee x_3)

\langle \rangle

(x_4 \vee x_5) \wedge (x_4 \vee x_6)

\langle \rangle

(x_4 \vee x_5) \wedge (x_4 \vee x_6)

\langle x_4 \mapsto 1 \rangle

()

\langle x_4 \mapsto 1 \rangle

(x_4 \vee x_5) \wedge (x_4 \vee x_6)

\langle x_4 \mapsto 0 \rangle

(x_5) \wedge (x_6)

\langle x_4 \mapsto 0 \rangle

(x_5) \wedge (x_6)

\langle x_4 \mapsto 0 \rangle

(x_5)

\langle x_4 \mapsto 0 \rangle

(x_5)

\langle x_4 \mapsto 0,x_5 \mapsto 1 \rangle

(x_5) \wedge (x_6)

\langle x_4 \mapsto 0 \rangle

(x_6)

\langle x_4 \mapsto 0 \rangle

(x_6)

\langle x_4 \mapsto 0,x_6 \mapsto 1 \rangle

(x_4 \vee x_6) \wedge (x_1 \vee x_2) \wedge (x_4 \vee x_5) \wedge (x_7 \vee x_9) \wedge (\neg x_7 \vee x_8) \wedge (x_1 \vee x_3)

\langle \rangle

(x_1 \vee x_3) \wedge (x_1 \vee x_2)

\langle \rangle

(x_1 \vee x_3) \wedge (x_1 \vee x_2)

\langle x_1 \mapsto 1 \rangle

(x_1 \vee x_3) \wedge (x_1 \vee x_2)

\langle x_1 \mapsto 0 \rangle

(x_3) \wedge (x_2)

\langle x_1 \mapsto 0 \rangle

(x_3) \wedge (x_2)

\langle x_1 \mapsto 0 \rangle

(x_3)

\langle x_1 \mapsto 0 \rangle

(x_3)

\langle x_1 \mapsto 0,x_3 \mapsto 1 \rangle

(x_3) \wedge (x_2)

\langle x_1 \mapsto 0 \rangle

(x_2)

\langle x_1 \mapsto 0 \rangle

(x_2)

\langle x_1 \mapsto 0,x_2 \mapsto 1 \rangle

(x_4 \vee x_6) \wedge (x_1 \vee x_2) \wedge (x_4 \vee x_5) \wedge (x_7 \vee x_9) \wedge (\neg x_7 \vee x_8) \wedge (x_1 \vee x_3)

\langle \rangle

(\neg x_7 \vee x_8) \wedge (x_7 \vee x_9)

\langle \rangle

(\neg x_7 \vee x_8) \wedge (x_7 \vee x_9)

\langle x_8 \mapsto 1 \rangle

(x_7 \vee x_9)

\langle x_8 \mapsto 1 \rangle

(x_7 \vee x_9)

\langle x_8 \mapsto 1,x_9 \mapsto 1 \rangle

(x_7 \vee x_9)

\langle x_8 \mapsto 1,x_9 \mapsto 0 \rangle

(x_7)

\langle x_8 \mapsto 1,x_9 \mapsto 0 \rangle

(x_7)

\langle x_7 \mapsto 1,x_8 \mapsto 1,x_9 \mapsto 0 \rangle

(\neg x_7 \vee x_8) \wedge (x_7 \vee x_9)

\langle x_8 \mapsto 0 \rangle

(x_7 \vee x_9) \wedge (\neg x_7)

\langle x_8 \mapsto 0 \rangle

(x_7 \vee x_9) \wedge (\neg x_7)

\langle x_7 \mapsto 0,x_8 \mapsto 0 \rangle

(x_9)

\langle x_7 \mapsto 0,x_8 \mapsto 0 \rangle

(x_9)

\langle x_7 \mapsto 0,x_8 \mapsto 0,x_9 \mapsto 1 \rangle

Exhaustive DPLL: summary

Run SAT solver to prune unsatisfiable branches. Exploit learnt clauses.
$\#F = \#F_1 \times \#F_2$ when $F_1$ and $F_2$ have disjoint variables.
$\#F = \#F[x \gets 0]+\#F[x \gets 1]$ otherwise.
When $\#G$ is computed, cache its value in case $G$ is encountered again.
Many tools based on this idea: d4, SharpSAT, SharpSAT-TD, etc.

Choosing variables

Complexity of exhaustive DPLL depends on the variable picked for branching
For SAT solver:
- try to quickly converge to conflicts to learn
- try to find one solution
- Many heuristics for this: VSIDS, Bohm, Maximum occurrence etc.
Very different for #SAT solvers: aim at factorizing the search space.
Find small separator: treewidth guided heuristics.

#SAT

A primer on SAT

The puzzling SAT puzzle

SAT as Propositional Logic

SAT is Hard

SAT as a “Programming Language”

Modeling reasoning

Modeling computation

NP-completeness

A Branching algorithm

Adding Unit Propagation

DPLL

CDCL Solvers

#SAT

Counting the number of solutions

Why counting?

How hard is it?

Counting by enumeration

Counting by enumeration

Caching in Exhaustive DPLL

Exploiting SAT solver efficiency

Connected components

Example with connected components

Exhaustive DPLL: summary

Choosing variables

Compile !

Trace of counting algorithm

Syntactic restriction of Boolean circuits

Limits of DPLL

Bottom up compilation

OBDD example

Conclusion

Slides you could have seen today

More general Circuits

Complexity of DPLL

Approximating #SAT

Lower bounds

Implementation

More queries

Applications

$(x_1 \oplus x_2 = 0)$	$\wedge$	$(x_3 \oplus x_4 = 0)$	$\wedge$	$(x_1 \vee x_4)$
	$\wedge$		$\wedge$	$x_1 \vee x_4$
			$\wedge$