Clotho
Emerging5papers using it
2024first seen
Papers using Clotho (5)
- Enclap: Combining Neural Audio Codec And Audio-text Joint Embedding For Automated Audio CaptioningGLAP: General contrastive audio-text pretraining across domains and languagesAISTAT lab system for DCASE2025 Task6: Language-based audio retrievalEnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for
Automated Audio CaptioningLanguage-based Audio Retrieval with Co-Attention Networks