← all datasets

39-problem benchmark

Emerging
1papers using it
2026first seen

The '39-problem benchmark' is a dataset used to evaluate the performance of formal reasoning systems by testing their ability to solve a set of predefined logical problems.

39-problem benchmark β€” datasets β€” time-series