Parallel Emptiness Checks Comparisons (TACAS'15)

Description

This benchmark provides a set of 3268 random formulas over 10 models extracted from the BEEM benchmark (http://anna.fi.muni.cz/models/). We compute formula such that for each models the NDFS of Gaiser and Schwoon takes 2 hours of computation for verified formula and 2 hours for violated formulas. All formulas where selected such that they take between 10 seconds and 30 minutes (for violated formula there exist a transition ordering that satisfies this constraint).

Replay

A README explaining how to launch the setups and interpret results here.

All experiments where performed using a fork of Spot avalaible here.

The benchmark for Spot can be found here.

The benchmark for Ltsmin can be found here.

The benchmark for Divine can be found here.

The resulting CSV can be found here.

How to interpret this page

We postfix algorithm names with _number-of-threads:

What does the field names stand for?

In the folowing tables we use descriptors that allows to compare each emptiness check variations.

Tools

Inputs

Cumulative Summary

Each column shows a sum over all inputs processed by one tool. The second column shows the count of inputs processed. Green and red highlight minimum and maximum values per column.

Average Summary

Line-Comparison

Cross-Comparison

Compare and in case of equality .

Tool vs. Tool Scatter

Compare against on and color by .