Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Hongxu Yin

13 papers Β· 86 citations
Most-cited papers
  • Regiongpt: Towards Region Understanding Vision Language Model
    2024 Β· 48 citations
  • LITA: Language Instructed Temporal-localization Assistant
    2024 Β· 35 citations
  • Scaling Vision Pre-training To 4K Resolution
    2025 Β· 3 citations
  • Nemotron 3 Nano Omni: Efficient And Open Multimodal Intelligence
    2026
Topics
Visual LanguageVideo Understanding3D VisionObject Detection

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.