Yuxuan Wang
18 papers Β· 0 citations
Most-cited papers
- Video-salmonn: Speech-enhanced Audio-visual Large Language Models2024 Β· 90 citations
- Hawkeye: Training Video-text Llms For Grounding Text In Videos2024 Β· 80 citations
- Halo: Estimation And Reduction Of Hallucinations In Open-source Weak Large Language Models2023 Β· 46 citations
- Llama Rider: Spurring Large Language Models To Explore The Open World2023 Β· 26 citations
- Efficient Temporal Extrapolation Of Multimodal Large Language Models With Temporal Grounding Bridge2024 Β· 16 citations
- Qwen3-vl Technical Report2025
- Hunyuan3d Studio: End-to-end AI Pipeline For Game-ready 3D Asset Generation2025
- Omnivideobench: Towards Audio-visual Understanding Evaluation For Omni Mllms2025
- Sounding That Object: Interactive Object-aware Image To Audio Generation2025
- Physcodebench: Benchmarking Physics-aware Symbolic Simulation Of 3D Scenes Via Self-corrective Multi-agent Refinement2026
Topics