ClauseSMT (ASE'2025): Artefact of "Improving NLSAT for Nonlinear Real Arithmetic"

Author: Zhonghan Wang (https://yogurt-shadow.github.io/)

work done during master student

Zenodo: https://zenodo.org/records/16887913

Structure of the Artefact

This artefact is intended to compare the sequential performance of our solver clauseSMT with other mainstream SMT solvers.

binary_solvers: The pre-compiled binary files of different versions of our solver and other existing solvers.
experiment_data: Our experimental results on the QF_NRA benchmark.
script: The scripts for generating the file path list and running the experiments.
source_code: The source code of different versions of our solver.
README: This file.
LICENSE: MIT License of the artefact.

Step 0. Build Docker

We recommend the user to conduct the experiment on Linux or Windows WSL.

To fully reproduce our experimental results, we highly recommend the user to get an external server for the parallel computing usage.

Run the following command in the project root directory (where the Dockerfile is located) to build the Docker image:

docker build -t clausesmt_env .

After building the image, start an interactive container with your project directory mounted inside:

docker run -it --rm -v $(pwd):/app clausesmt_env

Now you can execute all commands from the README inside the container without worrying about missing dependencies on your host system.

Step 1. Preparing Benchmark

PThe benchmark used in our paper is SMT-LIB for QF_NRA (2023 version) https://zenodo.org/records/10607722/files/QF_NRA.tar.zst?download=1

You are free to browse other versions of SMT-LIB: https://zenodo.org/communities/smt-lib/records?q=&l=list&p=1&s=10

wget https://zenodo.org/records/10607722/files/QF_NRA.tar.zst?download=1 -O QF_NRA.tar.zst
tar -I zstd -xvf QF_NRA.tar.zst
mv non-incremental/QF_NRA .
rm -rf non-incremental
rm -rf QF_NRA.tar.zst

After this step, the folder structure should look like this:

Step 2. Generate File Path List

To fully evaluate our solver on the whole benchmark, we should generate the absolute paths for all instances.

cd script
python3 generate_list.py ../QF_NRA/

The generated absolute paths are stored in QF_NRA/list.txt like below.

Step 3. Compile and Build

There are two ways to use our smt solver, either compile the source code or just use the binary files.

3.1 Compilation from Source Code

To compile the source code, run the following command

cd script
bash compile.sh <solver_name> <num_thread>

Here solver_name should be different versions of clauseSMT in source_code/, for example clauseSMT, nlsat or static-look-ahead

3.2 Binary Files

We also provide the pre-compiled binary files of our solver and other SMT solvers in binary_solvers/.

3.3 Test the Binary Files

We provide a simple smt instance example.smt2 for testing the binary files. Please run the following commands to ensure you successfully build the tool.

Compiled Binary Files

./source_code/clauseSMT/build/z3 example.smt2

Pre-compiled Binary Files

./binary_solvers/clauseSMT example.smt2

Step 4. Run clauseSMT on the Benchmark

To fully get the experimental results, we provide a script script/parallel_run.cpp to run our solver on all instances in the benchmark with multiple threads. We strongly recommend the user to use an external server for the parallel computing usage.

mkdir self_data
cd script
g++ -O3 -o parallel_run parallel_run.cpp
./parallel_run [instance_list_path] [solver_path] [output_path] [time_limit] [memory_limit] [max_process_num]

where instance_list_path is the path to the list file of test cases , solver_path is the path to the solver binary file, output_path is the path to collect the results, time_limit is the time limit (seconds) for each instance, memory_limit is the memory limit (MB) for each instance, and max_process_num is the maximum number of processes to run in parallel.

A recommended setting for the external server is:

./parallel_run ../QF_NRA/list.txt ../source_code/clauseSMT/build/z3 ../self_data/ 1200 30720 120

or

./parallel_run ../QF_NRA/list.txt ../binary_solvers/clauseSMT ../self_data/ 1200 30720 120

if your would like to use the pre-compiled binary files.

In SMT-COMP, the standard timelimit is 1200 seconds and the memory limit is 30720 MB for each instance.

If the user can not get an advanced external server and would like to narrow the test set, please manually modify the instance_list_path to a smaller list files containing a subset of the benchmark.

If everything goes well, the self_data folder would contain txt files for each instance, each containing the solver's output and statistics.

Step 5. Collecting Results

We provide a script script/collect.py to collect the results of all solvers on all test cases, and generate a csv file. The script takes two arguments:

folder_path: the path to the folder containing the results (for example: ../self_data/)
output_file: the path to the output csv file (for example: csv/clauseSMT.csv)

mkdir csv
cd script
python collect.py ../self_data/ csv/clauseSMT.csv

Then the csv/clauseSMT.csv file would contain the results of our solver on all test cases, including the instance name, solver's output, time and memory consumption.

Step 6. Comparison with our results

We provide the experimental results of our solver on the whole benchmark in experiment_data/. The user can compare their results with ours to see the performance of our solver.

Step 7. Other SMT Solvers

We also provide the source code and binary files of other SMT solvers, including NLSAT, Z3, CVC5, Yices2, dReal, and MathSAT. The user can use these solvers to compare their results with ours, following the steps in the previous sections.

Appendix: Description of Solvers and Experimental Results

Comparison with Existing SMT Solvers

Solver	Path	Data	Usage	Sat	Unsat	Solved
NLSAT	NLSAT	NLSAT_result	./NLSAT <*.smt2>	5541	5191	10732
Z3	z3	z3_result	./z3 <*.smt2>	5569	5379	10948
CVC5	cvc5	cvc5_result	./cvc5 <*.smt2>	5475	5809	11284
Yices2	yices2	yices2_result	./yices2 <*.smt2>	5372	5612	10984
dReal (delta=0.001)	dReal	dReal_result	./dReal --precision 0.001 <*.smt2>	4811	4294	9105
MathSAT	mathsat	mathsat_result	./mathsat <*.smt2>	2772	4583	7355
clauseSMT (Ours)	clauseSMT	clauseSMT_result	./clauseSMT <*.smt2>	5608	5397	11005

Effect of Proposed Techniques (Ablation Study)

Effect of Look-Ahead Mechanism

Solver	Description	Path	Data	Usage	Sat	Unsat	Solved
NLSAT	Decide Lowest Degree Literal	NLSAT	NLSAT_result	./NLSAT <*.smt2> -st	5541	5191	10732
random_decide	Decide Random Literal	random_decide	random_decide_result	./random_decide <*.smt2> -st	5505	5147	10652
static-look-ahead	Feasible-set based Look-Ahead	static-look-ahead	static-look-ahead_result	./static-look-ahead <*.smt2> -st	5555	5223	10778

Effect of Clause-Level Propagation based Branching Heuristic

Solver	Description	Path	Data	Usage	Sat	Unsat	Solved
static-look-ahead	Static order based on degree	static-look-ahead	static-look-ahead_result	./static-look-ahead <*.smt2> -st	5555	5223	10778
vsids-look-ahead	Dynamic order based on VSIDS	vsids-look-ahead	vsids-look-ahead_result	./vsids-look-ahead <*.smt2> -st	5599	5321	10920
clauseSMT (Ours)	Dynamic order based on clause-level propagation	clauseSMT	clauseSMT_result	./clauseSMT <*.smt2> -st	5608	5397	11005

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
binary_solvers		binary_solvers
experiment_data		experiment_data
pictures		pictures
script		script
source_code		source_code
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dockerfile		dockerfile
example.smt2		example.smt2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ClauseSMT (ASE'2025): Artefact of "Improving NLSAT for Nonlinear Real Arithmetic"

Structure of the Artefact

Step 0. Build Docker

Step 1. Preparing Benchmark

Step 2. Generate File Path List

Step 3. Compile and Build

3.1 Compilation from Source Code

3.2 Binary Files

3.3 Test the Binary Files

Step 4. Run clauseSMT on the Benchmark

Step 5. Collecting Results

Step 6. Comparison with our results

Step 7. Other SMT Solvers

Appendix: Description of Solvers and Experimental Results

Comparison with Existing SMT Solvers

Effect of Proposed Techniques (Ablation Study)

Effect of Look-Ahead Mechanism

Effect of Clause-Level Propagation based Branching Heuristic

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

yogurt-shadow/ClauseSMT_ASE2025

Folders and files

Latest commit

History

Repository files navigation

ClauseSMT (ASE'2025): Artefact of "Improving NLSAT for Nonlinear Real Arithmetic"

Structure of the Artefact

Step 0. Build Docker

Step 1. Preparing Benchmark

Step 2. Generate File Path List

Step 3. Compile and Build

3.1 Compilation from Source Code

3.2 Binary Files

3.3 Test the Binary Files

Step 4. Run clauseSMT on the Benchmark

Step 5. Collecting Results

Step 6. Comparison with our results

Step 7. Other SMT Solvers

Appendix: Description of Solvers and Experimental Results

Comparison with Existing SMT Solvers

Effect of Proposed Techniques (Ablation Study)

Effect of Look-Ahead Mechanism

Effect of Clause-Level Propagation based Branching Heuristic

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages