Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Lewei Lu

10 papers · 0 citations
Most-cited papers
  • How Far Are We To GPT-4V? Closing The Gap To Commercial Multimodal Models With Open-source Suites
    2024 · 339 citations
  • Fuseformer: Fusing Fine-grained Information In Transformers For Video Inpainting
    2021 · 143 citations
  • Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks
    2024 · 5 citations
  • Synergen-vl: Towards Synergistic Image Understanding And Generation With Vision Experts And Token Folding
    2024 · 4 citations
  • PVC: Progressive Visual Token Compression For Unified Image And Video Processing In Large Vision-language Models
    2024 · 2 citations
  • From Pixels To Words -- Towards Native Vision-language Primitives At Scale
    2025
  • Streamline Without Sacrifice -- Squeeze Out Computation Redundancy In LMM
    2025
  • Spatial Preference Rewarding For Mllms Spatial Understanding
    2025
  • Scaling Spatial Intelligence With Multimodal Foundation Models
    2025
  • Sensenova-mars: Empowering Multimodal Agentic Reasoning And Search Via Reinforcement Learning
    2025
Topics
Vision-Language ModelsVisual LanguageVideo UnderstandingImage Generation3D VisionImage RestorationObject DetectionVideo-LanguageInstruction TuningBenchmarks

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.