Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Song Han

15 papers · 0 citations
Most-cited papers
  • Efficient Streaming Language Models With Attention Sinks
    2023 · 1654 citations
  • VILA: On Pre-training For Visual Language Models
    2023 · 803 citations
  • Longlora: Efficient Fine-tuning Of Long-context Large Language Models
    2023 · 253 citations
  • Duoattention: Efficient Long-context LLM Inference With Retrieval And Streaming Heads
    2024 · 211 citations
  • Scaling Vision Pre-training To 4K Resolution
    2025 · 3 citations
  • Streamingvlm: Real-time Understanding For Infinite Video Streams
    2025
  • DC-AE 1.5: Accelerating Diffusion Model Convergence With Structured Latent Space
    2025
  • Dc-gen: Post-training Diffusion Acceleration With Deeply Compressed Latent Space
    2025
  • Sana-video: Efficient Video Generation With Block Linear Diffusion Transformer
    2025
  • Nemotron 3 Nano Omni: Efficient And Open Multimodal Intelligence
    2026
  • Sparse Videogen2: Accelerate Video Generation With Sparse Attention Via Semantic-aware Permutation
    2026
  • Sparse Videogen2: Accelerate Video Generation With Sparse Attention Via Semantic-aware Permutation
    2026
  • EGM: Efficient Visual Grounding Language Models
    2026
Topics
EfficiencyModel ArchitectureTraining TechniquesVideo UnderstandingVisual LanguageIn-Context LearningUncategorizedVision-Language ModelsVision-LanguageFine-Tuning

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.