← all datasets

GSM8K

Emerging
37papers using it
2022first seen

GSM8K is a benchmark dataset that contains mathematical reasoning problems used to evaluate the performance of language models on complex reasoning tasks.

Papers using GSM8K (37)

GSM8K β€” datasets β€” ai-for-code