Yi Yu
12 papers · 0 citations
Most-cited papers
- Deep Cross-modal Correlation Learning For Audio And Lyrics In Music Retrieval2017 · 74 citations
- Deep Triplet Neural Networks With Cluster-cca For Audio-visual Cross-modal Retrieval2019 · 47 citations
- Audio-visual Embedding For Cross-modal Musicvideo Retrieval Through Supervised Deep CCA2019 · 38 citations
- Variational Autoencoder With CCA For Audio-visual Cross-modal Retrieval2021 · 20 citations
- Anchor-aware Deep Metric Learning For Audio-visual Retrieval2024 · 6 citations
- Semantic Frame Aggregation-based Transformer For Live Video Comment Generation2025
- TEAR: Temporal-aware Automated Red-teaming For Text-to-video Models2025
- SAVER: Mitigating Hallucinations In Large Vision-language Models Via Style-aware Visual Early Revision2025
- Memverse: Multimodal Memory For Lifelong Learning Agents2025
- Reallocating Attention Across Layers To Reduce Multimodal Hallucination2025
Topics