Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Yuanxing Zhang

22 papers Β· 0 citations
Most-cited papers
  • Scone: Bridging Composition And Distinction In Subject-driven Image Generation Via Unified Understanding-generation Modeling
    2025
  • RICO: Improving Accuracy And Completeness In Image Recaptioning Via Visual Reconstruction
    2025
  • Small-large Collaboration: Training-efficient Concept Personalization For Large VLM Using A Meta Personalized Small VLM
    2025
  • Diadem: Advancing Dialogue Descriptions In Audiovisual Video Captioning For Multimodal Large Language Models
    2026
Topics
Vision-Language ModelsVisual QA & ReasoningBenchmarksImage-Text RetrievalInstruction TuningAudio-VisualVideo-Language

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.