Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Sihan Yang

11 papers Β· 0 citations
Most-cited papers
  • RICO: Improving Accuracy And Completeness In Image Recaptioning Via Visual Reconstruction
    2025
  • Small-large Collaboration: Training-efficient Concept Personalization For Large VLM Using A Meta Personalized Small VLM
    2025
  • Unictokens: Boosting Personalized Understanding And Generation Via Unified Concept Tokens
    2025
  • Vidbridge-r1: Bridging QA And Captioning For Rl-based Video Understanding Models With Intermediate Proxy Tasks
    2025
  • GRAN-TED: Generating Robust, Aligned, And Nuanced Text Embedding For Diffusion Models
    2025
Topics
Vision-Language ModelsVisual QA & ReasoningImage-Text RetrievalInstruction TuningVideo-LanguageBenchmarks

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.