Awesome Multimodal
π
Papers
π§
Topics
π₯
Trending
πΊοΈ
Map
π
Leaderboards
π
Learn
π€
Ask AI
β―
More
π₯
Authors
π
Reading Packs
π
Datasets
π οΈ
Tools
π°
News
π
Blogs
βοΈ
Newsletter
π
Saved
+ Add Paper
βΎ
β
β all datasets
WavCaps
Emerging
3
papers using it
2024
first seen
π Find this dataset
Papers using WavCaps (3)
AC/DC: LLM-based Audio Comprehension via Dialogue Continuation
2025
Representation Learning for Semantic Alignment of Language, Audio, and Visual Modalities
2025
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
2024
π€
Ask AI
WavCaps β datasets β multimodal