← all datasets

MATH

Canonical
43papers using it
2024first seen

The 'MATH' dataset is a benchmark that contains a collection of mathematical problems used to evaluate the performance of large language models in solving mathematical reasoning tasks.

Papers using MATH (43)

MATH β€” datasets β€” llm-papers