UNSAT Solver Synthesis via Monte Carlo Forest Search

Chris Cameron*, Jason Hartford, Taylor Lundy, Tuan Truong, Alan Milligan, Rex Chen, Kevin Leyton-Brown

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

We introduce Monte Carlo Forest Search (MCFS), a class of reinforcement learning (RL) algorithms for learning policies in tree MDPs, for which policy execution involves traversing an exponential-sized tree. Examples of such problems include proving unsatisfiability of a SAT formula; counting the number of solutions of a satisfiable SAT formula; and finding the optimal solution to a mixed-integer program. MCFS algorithms can be seen as extensions of Monte Carlo Tree Search (MCTS) to cases where, rather than finding a good path (solution) within a tree, the problem is to find a small tree within a forest of candidate trees. We instantiate and evaluate our ideas in an algorithm that we dub Knuth Synthesis, an MCFS algorithm that learns DPLL branching policies for solving the Boolean satisfiability (SAT) problem, with the objective of achieving good average-case performance on a given distribution of unsatisfiable problem instances. Knuth Synthesis is the first RL approach to avoid the prohibitive costs of policy evaluations in an exponentially-sized tree, leveraging two key ideas: first, we estimate tree size by randomly sampling paths and measuring their lengths, drawing on an unbiased approximation due to Knuth (1975); second, we query a strong solver at a user-defined depth rather than learning a policy across the whole tree, to focus our policy search on early decisions that offer the greatest potential for reducing tree size. We matched or exceeded the performance of a strong baseline on three well-known SAT distributions, facing problems that were two orders of magnitude more challenging than those addressed in previous RL studies.
Original languageEnglish
Title of host publicationIntegration of Constraint Programming, Artificial Intelligence, and Operations Research
Subtitle of host publication21st International Conference, CPAIOR 2024, Uppsala, Sweden, May 28–31, 2024, Proceedings, Part I
EditorsBistra Dilkina
Place of PublicationCham
PublisherSpringer Cham
Pages170-189
Number of pages20
ISBN (Electronic)9783031605970
ISBN (Print)9783031605963
DOIs
Publication statusPublished - 28 Mar 2024
EventInternational Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research - Uppsala, Sweden
Duration: 28 May 202431 May 2024

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume14742
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research
Abbreviated titleCPAIOR 2024
Country/TerritorySweden
CityUppsala
Period28/05/2431/05/24

Fingerprint

Dive into the research topics of 'UNSAT Solver Synthesis via Monte Carlo Forest Search'. Together they form a unique fingerprint.

Cite this