Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Chen Sun

5 papers · 815 citations
Most-cited papers
  • Multi-modal Transformer For Video Retrieval
    2020 · 393 citations
  • Composing Text And Image For Image Retrieval - An Empirical Odyssey
    2018 · 311 citations
  • Multiview Transformers For Video Recognition
    2022 · 259 citations
  • REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory
    2022 · 65 citations
  • Learning Audio-video Modalities From Image Captions
    2022 · 46 citations
  • How Can Objects Help Action Recognition?
    2023 · 21 citations
  • Motif: Making Text Count In Image Animation With Motion Focal Loss
    2024 · 1 citations
  • Steerable Equivariant Representation Learning
    2023
  • EIMC: Efficient Instance-aware Multi-modal Collaborative Perception
    2026
Topics
Image RetrievalVideo UnderstandingCross-Modal HashingObject DetectionImage GenerationVisual LanguageAutonomous Driving3D Vision

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.