#ModelSuccess ratePaper
1Hera: Learning Long-Horizon Coordination for Device-Cloud Collaborative LLM Agents92.50β€”
2What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents90.00β€”
3Hindsight Credit Assignment for Long-Horizon LLM Agents13.80β€”
4SKILLC: Learning Autonomous Skill Internalization in LLM Agents via Contrastive Credit Assignment5.50β€”
ALFWorld alfworld Leaderboard