runtime-benchmarks

Benchmarks to compare the performance of async runtimes / executors.

An interactive view of the full results dataset is available at: https://fleetcode.com/runtime-benchmarks/

Results summary table of a single configuration:

Runtime	libfork	TooManyCooks	tbb	taskflow	cppcoro	coros	HPX	concurrencpp	libcoro
Mean Ratio to Best (lower is better)	1.00x	1.11x	2.79x	2.95x	3.00x	4.41x	164.68x	172.02x	2247.44x
skynet(8)	39509 us	46285 us	141389 us	205437 us	171084 us	104557 us	15275347 us	12211548 us	155806778 us
fib(39)	67773 us	82517 us	269588 us	200510 us	264781 us	172050 us	14422928 us	18555453 us	304651430 us
nqueens(14)	78595 us	83610 us	163150 us	166061 us	173162 us	883629 us	4522909 us	8142602 us	42437681 us
matmul(2048)	41751 us	41608 us	64036 us	63297 us	64771 us	50476 us	72353 us	67167 us	459776 us

Click to view the machine configuration used in the summary table

Processor: EPYC 7742 64-core processor
Worker Thread Count: 64 (no SMT)
OS: Debian 13 Server
Compiler: Clang 21.1.7 Release (-O3 -march=native)
CPU boost enabled / schedutil governor
Linked against libtcmalloc_minimal.so.4

What's covered?

Currently only includes C++ frameworks, and several recursive fork-join benchmarks:

recursive fibonacci (forks x2)
skynet (original link) but increased to 100M tasks (forks x10)
nqueens (forks up to x14)
matmul (forks x4)

Benchmark problem sizes were chosen to balance between making the total runtime of a full sweep tolerable (especially on weaker hardware with slower runtimes), and being sufficiently large to show meaningful differentiation between faster runtimes.

How to build and run the benchmarks yourself

Install Dependencies:

The build+bench script uses python3. The only Python dependency is libyaml.
CMake + Clang 18 or newer
libfork and TooManyCooks depend on the hwloc library.
TBB benchmarks depend on system installed TBB - see the installation guide here for the newest version or you may be able to find the old version 'libtbb-dev' in your system package manager
boost::cobalt requires Boost 1.82 or newer. You may need to build Boost from source, since cobalt is currently not included in distro packages.
A high performance allocator (tcmalloc, jemalloc, or mimalloc) is also recommended. The build script will dynamically link to any of these if they are available.

On Debian/Ubuntu: sudo apt-get install cmake hwloc libhwloc-dev intel-oneapi-tbb-devel libtcmalloc-minimal4

On MacOS: brew install cmake gperftools hwloc libyaml tbb

Get Quick Results (uses threads = #CPUs):

NOTE: If a particular library or benchmark fails to build or run, don't worry - its output will simply be ignored.

python3 ./build_and_bench_all.py

Results will appear in RESULTS.md and RESULTS.csv files.

Get Full Results (sweeps threads from 1 to #CPUs):

python3 ./build_and_bench_all.py full

Results will also appear in RESULTS.json file; this file can be parsed by the interactive benchmarks site. A locally viewable version of this HTML chart will be generated as well.

Future Plans

Frameworks to come:

(C#) .Net thread pool
(Rust) tokio
(Golang) goroutines
Facebook Folly
PhotonLibOS https://github.com/alibaba/PhotonLibOS

Benchmarks to come:

Lots of good inspiration here

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
cpp		cpp
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
build_and_bench_all.py		build_and_bench_all.py
clean_all.sh		clean_all.sh
generate_results_md.py		generate_results_md.py
get_nproc.sh		get_nproc.sh
merge_results.py		merge_results.py
results.html.tmpl		results.html.tmpl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

runtime-benchmarks

What's covered?

How to build and run the benchmarks yourself

Install Dependencies:

Get Quick Results (uses threads = #CPUs):

Get Full Results (sweeps threads from 1 to #CPUs):

Future Plans

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

tzcnt/runtime-benchmarks

Folders and files

Latest commit

History

Repository files navigation

runtime-benchmarks

What's covered?

How to build and run the benchmarks yourself

Install Dependencies:

Get Quick Results (uses threads = #CPUs):

Get Full Results (sweeps threads from 1 to #CPUs):

Future Plans

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages