← all datasets

GSM8K

Emerging
3papers using it
2026first seen

The 'GSM-8K' dataset is a benchmark that contains a collection of mathematical reasoning problems used to evaluate the performance of large language models in solving such problems.

GSM8K β€” datasets β€” graph-learning