David Harwath
9 papers ยท 342 citations
Most-cited papers
- Everything At Once -- Multi-modal Fusion Transformer For Video Retrieval2021 ยท 126 citations
- Jointly Discovering Visual Objects And Spoken Words From Raw Sensory Input2018 ยท 79 citations
- Multimodal Clustering Networks For Self-supervised Learning From Unlabeled Videos2021 ยท 58 citations
- Avlnet: Learning Audio-visual Language Representations From Instructional Videos2020 ยท 51 citations
- Why Is Winoground Hard? Investigating Failures In Visuolinguistic Compositionality2022 ยท 11 citations
Top co-authors
Topics