← all datasets

GSM8K

Emerging
6papers using it
2024first seen

GSM8K is a benchmark dataset that contains a collection of 8,000 diverse mathematical word problems used to evaluate language reasoning capabilities in models.

Papers using GSM8K (6)

GSM8K β€” datasets β€” multimodal