Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Bei Yu

10 papers Β· 0 citations
Most-cited papers
  • Visionthink: Smart And Efficient Vision Language Model Via Reinforcement Learning
    2025
  • Unimoco: Unified Modality Completion For Robust Multi-modal Embeddings
    2025
  • Rtime-qa: A Benchmark For Atomic Temporal Event Understanding In Large Multi-modal Models
    2025
  • Visionreasoner: Unified Reasoning-integrated Visual Perception Via Reinforcement Learning
    2025
  • Visurf: Visual Supervised-and-reinforcement Fine-tuning For Large Vision-and-language Models
    2025
Topics
Vision-Language ModelsVideo-LanguageImage-Text RetrievalBenchmarksInstruction TuningVisual QA & Reasoning

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.