Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

David Harwath

9 papers Β· 342 citations
Most-cited papers
  • Everything At Once -- Multi-modal Fusion Transformer For Video Retrieval
    2021 Β· 126 citations
  • Jointly Discovering Visual Objects And Spoken Words From Raw Sensory Input
    2018 Β· 79 citations
  • Multimodal Clustering Networks For Self-supervised Learning From Unlabeled Videos
    2021 Β· 58 citations
  • Avlnet: Learning Audio-visual Language Representations From Instructional Videos
    2020 Β· 51 citations
  • Why Is Winoground Hard? Investigating Failures In Visuolinguistic Compositionality
    2022 Β· 11 citations
Topics
Image RetrievalUncategorizedSupervised Hashing

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.