Benchmarking CPAchecker

CPAchecker provides a command-line flag --benchmark that should be used for all kinds of benchmarks. It improves performance by disabling assertions and optional features such as extended statistics and output files. Note that you can re-enable specific output files individually if desired.

In addition, care needs to be taken to specify meaningful resource limits and some other options depending on the specifics of the verification task, as described in the following.

Specifying Machine Model and Further Program Characteristics

It is important to specify the correct values for the following options according to the characteristics of the input files to avoid wrong answers:

Machine model (32 vs. 64 bit, x86 vs. ARM): A 32-bit model for Linux on x86 (ILP32) is assumed by default. For 64-bit Linux on x86 (LP64), specify --64 on the command line.
Whether malloc may return null or not: If the program assumes malloc never returns null and you are using a predicate-based analysis, set cpa.predicate.memoryAllocationsAlwaysSucceed=true. Note that this assumption is true for all SV-Comp files, thus this option is already set in all --svcomp* configurations.

Specifying Resource Limits

When benchmarking CPAchecker, it is important to correctly specify the time and memory limits. The examples in the following are based on benchmark definitions of BenchExec.

CPU Time. Firstly, in order to get statistics even in case of a timeout, is is important to specify different "soft" and "hard" CPU-time limits like in this example: <benchmark timelimit="900s" hardtimelimit="1000s" ... The soft time limit is automatically passed as parameter to CPAchecker, so there is no need to specify the --timelimit option manually.
Memory. The memory limit is specified in SI units (i.e., 1 MB = 1,000,000 Bytes) with the attribute memlimit in the <benchmark> tag of the benchmark definition XML file. Example: <benchmark ... memlimit="8000 MB"> This limit will be enforced by the OS and CPAchecker will be killed if it needs more memory.

Additionally, it is important to specify the amount of memory that Java uses for its own heap with the --heap command-line parameter. This value needs to be lower than the external limit. Setting it too low will hurt the performance due to increased garbage collection and provoke OutOfMemoryError, setting it too high limits the memory that is available to native libraries such as MathSAT. For analyses without MathSAT, start to experiment with 1000 MB less than the external limit. IMPORTANT: Java does not use SI units here, but IEC units (factor 1024). 7000M here are 5% more than 7000 MB for the memory limit above! Example: <option name="--heap">7000M</option>

Summary: For correct and useful benchmarks, choose a memory limit (e.g., 8000MB), a Java heap size (e.g., 7000MiB), and a timelimit (e.g., 900s). Then specify them as follows:

<benchmark ... timelimit="900s" hardtimelimit="1000s" memlimit="8000 MB">
  <option name="--heap">7000M</option>
  ...
</benchmark>

Benchmark Execution

For benchmarking, it is recommended to use BenchExec. The file ./examples/benchmark-cpachecker.xml can be used as base for own benchmarks. Several useful benchmark configurations are in test/test-sets/*.xml. Commented examples for these XML files are given in the BenchExec documentation.

An extended version of BenchExec that also supports benchmarking CPAchecker remotely via BenchCloud is provided as scripts/benchmark.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark.md

Benchmark.md

Benchmarking CPAchecker

Specifying Machine Model and Further Program Characteristics

Specifying Resource Limits

Benchmark Execution

Files

Benchmark.md

Latest commit

History

Benchmark.md

File metadata and controls

Benchmarking CPAchecker

Specifying Machine Model and Further Program Characteristics

Specifying Resource Limits

Benchmark Execution