Xiangyu Yue
19 papers · 0 citations
Most-cited papers
- Onellm: One Framework To Align All Modalities With Language2023 · 231 citations
- Imagebind-llm: Multi-modality Instruction Tuning2023 · 174 citations
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit2024 · 123 citations
- Chemllm: A Chemical Large Language Model2024 · 106 citations
- Onellm: One Framework To Align All Modalities With Language2023 · 79 citations
- Fira: Can We Achieve Full-rank Training Of Llms Under Low-rank Constraint?2024 · 38 citations
- Ditctrl: Exploring Attention Control In Multi-modal Diffusion Transformer For Tuning-free Multi-prompt Longer Video Generation2024 · 8 citations
- Training Matting Models Without Alpha Labels2024 · 2 citations
- Screencoder: Advancing Visual-to-code Generation For Front-end Automation Via Modular Multimodal Agents2025
- Scalecua: Scaling Open-source Computer Use Agents With Cross-platform Data2025
- Mmbench-gui: Hierarchical Multi-platform Evaluation Framework For GUI Agents2025
- Exploring Reasoning Reward Model For Agents2026
- Onethinker: All-in-one Reasoning Model For Image And Video2026
- Onethinker: All-in-one Reasoning Model For Image And Video2026
- Onethinker: All-in-one Reasoning Model For Image And Video2026
Topics