Countdown
Emerging7papers using it
2025first seen
Papers using Countdown (7)
- Discrete Tilt MatchingEscaping the Verifier: Learning to Reason via DemonstrationsTRE: Encouraging Exploration in the Trust RegionPrincipled RL for Diffusion LLMs Emerges from a Sequence-Level PerspectiveRL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMsTo Backtrack or Not to Backtrack: When Sequential Search Limits Model ReasoningHow Does RL Post-training Induce Skill Composition? A Case Study on Countdown