APPS
Emerging2papers using it
2026first seen
The 'APPS' dataset is a benchmark used to evaluate the reasoning capabilities of language models through a collection of programming problems.
The 'APPS' dataset is a benchmark used to evaluate the reasoning capabilities of language models through a collection of programming problems.