Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Yi Yu

12 papers · 0 citations
Most-cited papers
  • Deep Cross-modal Correlation Learning For Audio And Lyrics In Music Retrieval
    2017 · 74 citations
  • Deep Triplet Neural Networks With Cluster-cca For Audio-visual Cross-modal Retrieval
    2019 · 47 citations
  • Audio-visual Embedding For Cross-modal Musicvideo Retrieval Through Supervised Deep CCA
    2019 · 38 citations
  • Variational Autoencoder With CCA For Audio-visual Cross-modal Retrieval
    2021 · 20 citations
  • Anchor-aware Deep Metric Learning For Audio-visual Retrieval
    2024 · 6 citations
  • Semantic Frame Aggregation-based Transformer For Live Video Comment Generation
    2025
  • TEAR: Temporal-aware Automated Red-teaming For Text-to-video Models
    2025
  • SAVER: Mitigating Hallucinations In Large Vision-language Models Via Style-aware Visual Early Revision
    2025
  • Memverse: Multimodal Memory For Lifelong Learning Agents
    2025
  • Reallocating Attention Across Layers To Reduce Multimodal Hallucination
    2025
Topics
Cross-Modal HashingImage RetrievalVideo-LanguageVision-Language ModelsVisual QA & ReasoningBenchmarksEmbodied & Agents

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.