Yuxiao Dong
20 papers · 4530 citations
Most-cited papers
- GLM-130B: An Open Bilingual Pre-trained Model2022 · 1258 citations
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding2023 · 1169 citations
- Agentbench: Evaluating Llms As Agents2023 · 710 citations
- Lvbench: An Extreme Long Video Understanding Benchmark2024 · 304 citations
- Agenttuning: Enabling Generalized Agent Abilities For Llms2023 · 296 citations
- Agenttuning: Enabling Generalized Agent Abilities For Llms2023 · 21 citations
- Agentbench: Evaluating Llms As Agents2023
- Computerrl: Scaling End-to-end Online Reinforcement Learning For Computer Use Agents2025
- Logicgame: Benchmarking Rule-based Reasoning Abilities Of Large Language Models2024
Topics