← all datasets

APPS

Emerging

2papers using it

2026first seen

The 'APPS' dataset is a benchmark used to evaluate the reasoning capabilities of language models through a collection of programming problems.

🔎 Find this dataset

Papers using APPS (2)

PERSA: Reinforcement Learning for Professor-Style Personalized Feedback with LLMs2026

R$^2$PO: Decoupling Training Trajectories from Inference Responses for LLM Reasoning2026

APPS — datasets — reinforcement-learning