Chen Sun
5 papers · 815 citations
Most-cited papers
- Multi-modal Transformer For Video Retrieval2020 · 393 citations
- Composing Text And Image For Image Retrieval - An Empirical Odyssey2018 · 311 citations
- Multiview Transformers For Video Recognition2022 · 259 citations
- REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory2022 · 65 citations
- Learning Audio-video Modalities From Image Captions2022 · 46 citations
- How Can Objects Help Action Recognition?2023 · 21 citations
- Motif: Making Text Count In Image Animation With Motion Focal Loss2024 · 1 citations
- Steerable Equivariant Representation Learning2023
- EIMC: Efficient Instance-aware Multi-modal Collaborative Perception2026
Topics