← all datasets

GSM8K

Canonical
104papers using it
2024first seen

The 'GSM8K' dataset is a benchmark that contains math problems designed to evaluate the performance of multimodal large language models (MLLMs) in understanding and solving mathematical tasks.

Papers using GSM8K (104)

GSM8K β€” datasets β€” llm-papers