WebShop webshop Leaderboard
Auto-discovered from papers reporting WebShop (Success rate). Β· Metric: Success rate (higher is better)
| # | Model | Success rate | Paper |
|---|---|---|---|
| 1 | What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents | 80.10 | β |
| 2 | Hindsight Credit Assignment for Long-Horizon LLM Agents | 7.70 | β |
| 3 | SKILLC: Learning Autonomous Skill Internalization in LLM Agents via Contrastive Credit Assignment | 4.40 | β |