ClothoV-2
Emerging1papers using it
2024first seen
ClothoV-2 is an audio-caption dataset that contains pairs of audio recordings and their corresponding textual descriptions, used to evaluate the performance of audio retrieval systems.
ClothoV-2 is an audio-caption dataset that contains pairs of audio recordings and their corresponding textual descriptions, used to evaluate the performance of audio retrieval systems.