* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Proof translation for CVC3
Turing's proof wikipedia , lookup
Axiom of reducibility wikipedia , lookup
Truth-bearer wikipedia , lookup
Model theory wikipedia , lookup
Gödel's incompleteness theorems wikipedia , lookup
Sequent calculus wikipedia , lookup
Laws of Form wikipedia , lookup
Naive set theory wikipedia , lookup
Georg Cantor's first set theory article wikipedia , lookup
List of first-order theories wikipedia , lookup
Foundations of mathematics wikipedia , lookup
Mathematical logic wikipedia , lookup
Boolean satisfiability problem wikipedia , lookup
Natural deduction wikipedia , lookup
Proof translation
from CVC3 to Hol light
Yeting Ge
Acsys
Mar 5, 2008
CVC3: a SMT solver
CVC3 is complicated
SAT, decision procedures, ……
About 400k lines of code in all
Are the results from CVC3 correct?
Extremely difficult to verify CVC3 is correct
Check the proofs from CVC3
CVC3 can produce a “proof” for a unsat case
Proofs are big and a proof checker is needed
Is the proof checker correct?
Have to check hundreds of proof rules
Outline
SMT solvers and CVC3
HOL and Hol light
Features
Proofs in HOL
Translation from CVC3 into Hol light
SMT example
Proofs in CVC3
Boolean resolution
Theory proof rules
SMT LIB benchmarks certification
SMT solver
Satisfiability Modulo Theories
Arithmetic, bit vector, array, equality,……
Is plus (a, b) plus (b, a) satifisabile?
Abstraction
Theory solver
SAT solver
Equality
Arithmetic
……
SMT example
To prove (a b) ( f (a) f (b)) is unsatisfiable
Abstraction
(a b) ( f (a) f (b))
b1 : (a b)
b2 : f (a) f (b)
Theory solver
SAT solver
unsat
b1 b2
No more
{ b1 T , b2 F }
ab
f (a ) f (b)
T unsat
Proofs in CVC3
Proofs from theory solvers
Proofs from the SAT solver
Modern SAT solvers can dump proofs
A tree of boolean resolutions
To prove ~A \/ B, ~A \/ ~B, A |- F
A:BOOLEAN;
B:BOOLEAN;
ASSERT(NOT A OR B);
ASSERT((NOT A) OR (NOT B));
ASSERT(A);
QUERY(FALSE);
DUMP_PROOF;
Boolean resolution
~A \/ B, ~A \/ ~B, A |- F
Dumped proof from minisat
5
6
9
10
11
12
13
I
I
I
I
I
D
D
: +2 -1 -3 :
: +1 :
: -2 -3 -4 :
: +4 :
: +3 :
: +2 : 5 -1 6 -3 11
: : 9 -2 12 -3 11 -4 10
1
2
3
4
:
:
:
:
5I
6I
9I
10 I
11 I
12 D
(B \/ ~A)
B
A
(~A \/ ~B)
:
:
:
:
:
:
B, ~(B \/ ~A), ~A
(B \/ ~A)
~B, ~A, ~(~A \/ ~B)
(~A \/ ~B)
A
B:
B, ~A : 5 6
B : 11
13 D : :
~A, ~(~A \/ ~B) : 9 12
~(~A \/ ~B) : 11
: : 10
The proof from CVC3
Proof(minisat_proof(FALSE,
bool_resolution(NOT (NOT A OR NOT B),
bool_resolution(NOT A,
bool_resolution(NOT B,
CNF("or_final", (NOT A OR NOT B), (NOT A OR NOT B), 0),
bool_resolution(NOT A,
bool_resolution(NOT (B OR NOT A),
CNF("or_final", (B OR NOT A), (B OR NOT A), 0),
cnf_add_unit((B OR NOT A),
iff_mp((NOT A OR B), (B OR NOT A), assump_23,
rewrite_or((NOT A OR B), (B OR NOT A))))),
cnf_add_unit(A, assump_25))),
cnf_add_unit(A, assump_25)),
cnf_add_unit((NOT A OR NOT B), assump_24))))
Proofs from theory solvers
Proof rules are much more complicated than
boolean resolution
Over 400 proof rules in CVC3
Example: mult_eqn
|- (x = y) <=> (x * z = y * z)
A proof checker must make sure that z is not
equivalent to 0 , which is not a easy job
Ideal proof checker for SMT
solvers
CNF clauses in CVC3
Orginal clauses (assumptions)
CNF translation clauses
Theory clauses
Tautologies (not always)
Extra clauses asserted by theory solvers
Can check boolean resolution and tautologies
Can handle all theory proof rules
Theory specific calculations
HOL family of proof assistants
Based on higher order logic (lambda calculus)
Powerful, can formalize most mathematics
Simple and small core
Definitional extension
only four kinds of terms
All theories (even /\ \/ ) are defined
All theorems must be created in a constructive way
Soundness is guaranteed if the core is correct
Implemented in ML
Programmable, easy to extend and include new
decision procedures
Hol light
Minimized core
10 inference rules on equality
3 axioms (axiom of choice, infinity)
about 400 lines of Ocaml
Chosen for a number of projects
Verification of float point algorithm at Intel
Kepler Conjecture
A group of experts spent five years, unable to verify the proof
Formalize the proof in Hol light
Includes theory of arithmetic
Proofs in Hol light
All theorem are constructed by using Hol
proof rules
#ASSUME `a:bool`;;
val it : thm = a |- a
Derived proof rules are just Ocaml functions
let PROVE_HYP ath bth =
if exists (aconv (concl ath)) (hyp bth)
then EQ_MP (DEDUCT_ANTISYM_RULE ath
bth) ath
else bth;;
Translate proofs into HOL light
Instead of a proof checker, we propose a
translator of the proofs from CVC3 into Hol
light
Proof checking is done by Hol Light
If the translation is successful, then the same
theorem is proved in Hol light
If a theorem is proved in Hol light, we are
more confident that the theorem is true
Translation into Hol light
Hol light and CVC3 are connected through C
interface of Ocaml and CVC3
CVC3 terms are translated into Hol terms
CVC3 uninterpreted functions are translated into
combination
For each CVC3 proof rules, we write a Ocaml
function
Prove a higher order theorem, then instantiate it
Translate boolean resolution
Suppose two theorems, corresponding two CNF
clauses, have been proved in HOL
(1) … |- A1 \/ (A2 \/ (A3 \/ ……)))
(2) … |- B1 \/ (~A2 \/ (B3 \/ ……)))
The desired theorem is:
(3) …|- A1 \/ A3 \/ B1 \/ B3 \/ ……
The proof of (3) is time consuming
Duplicated terms in the (3) must be removed
Change the representation
(1)’
(2)’
… ~A1 , ~A2 ,~A3 …… |- F
… ~B1 , A2 , ~B3 …… |- F
hole5
Translate theory proof rules
|- (x = y) <=> (x * z = y * z)
let x = translate_term vc (child expr 1) in
let y = translate_term vc (child expr 2) in
let z = translate_term vc (child expr 3) in
let znz = prove_DIV_NOT_EQ_0 z in
SPECL[x;y] (MATCH_MP REAL_NZ_RMUL znz)
# REAL_NZ_RMUL;;
val it : thm = |- !x y z. ~(z = &0) ==> (x = y <=> x * z = y * z)
A problem
CVC3 proves a theoem Tcvc 3
Tcvc 3 is translated into Hol light that produces
a theorem Thol
Are Thol and Tcvc 3 the same theorem?
A tentative solution:
Dump Tcvc 3 and Thol into some canonical form
Compare the canonized theorems in syntax
Dump Thol from Hol light
Translate Thol back into CVC3 and dump it from CVC3
SMT LIB benchmarks certification
SMT LIB
SMT COMP
A collection of smt benchmarks
Arithmetic, Bit vector, array, unintepreted function,……
The ‘status’ in each case shows whether it is sat, unsat
or unknown
Annual competition for SMT solvers
Are the answers from SMT solvers correct?
Are the ‘status’ fields in SMT LIB benchmarks show the
correct results
We propose to prove these benchmarks in Hol light
A certificate to show a case is proved
Future work
Prove more cases in Hol light
Support more proof rules
Define new theories in Hol light
theory of array are defined by a new axiom