39-problem benchmark

Emerging

1papers using it

2026first seen

The '39-problem benchmark' is a dataset used to evaluate the performance of formal reasoning systems by testing their ability to solve a set of predefined logical problems.

🔎 Find this dataset

39-problem benchmark — datasets — time-series