Clotho
Emerging1papers using it
2024first seen
The 'Clotho' dataset is a benchmark for audio-text retrieval that contains audio clips paired with corresponding captions, used to evaluate the effectiveness of models in retrieving relevant text given an audio input and vice versa.