← all datasets

AudioCaps

Emerging
11papers using it
2023first seen

AudioCaps is a dataset used to evaluate video-to-audio generation methods by providing audio clips paired with descriptive captions.

Papers using AudioCaps (11)

AudioCaps β€” datasets β€” multimodal