TAU-bench
Emerging5papers using it
2025first seen
Papers using TAU-bench (5)
- Self-Challenging Language Model AgentsMulti-Turn Reinforcement Learning for Tool-Calling Agents with Iterative Reward CalibrationCM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool UseGeneralizable End-to-End Tool-Use RL with Synthetic CodeGymRobust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors