ALFWorld alfworld Leaderboard
Auto-discovered from papers reporting ALFWorld (Success rate). Β· Metric: Success rate (higher is better)
| # | Model | Success rate | Paper |
|---|---|---|---|
| 1 | Hera: Learning Long-Horizon Coordination for Device-Cloud Collaborative LLM Agents | 92.50 | β |
| 2 | What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents | 90.00 | β |
| 3 | Hindsight Credit Assignment for Long-Horizon LLM Agents | 13.80 | β |
| 4 | SKILLC: Learning Autonomous Skill Internalization in LLM Agents via Contrastive Credit Assignment | 5.50 | β |