Siyuan Huang
23 papers · 2 citations
Most-cited papers
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models2023 · 288 citations
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models2024 · 149 citations
- Advancing 3D Scene Understanding With Mv-scanqa Multi-view Reasoning Evaluation And Tripalign Pre-training Dataset2025 · 1 citations
- Persistent Visual Memory: Sustaining Perception For Deep Generation In Lvlms2026
Topics