GSM8K
Emerging3papers using it
2026first seen
The 'GSM-8K' dataset is a benchmark that contains a collection of mathematical reasoning problems used to evaluate the performance of large language models in solving such problems.
The 'GSM-8K' dataset is a benchmark that contains a collection of mathematical reasoning problems used to evaluate the performance of large language models in solving such problems.