← all datasets

ACAVCaps

Emerging
2papers using it
2025first seen

ACAVCaps is a large-scale, fine-grained, and multi-faceted audio captioning dataset designed to evaluate general audio understanding in large audio-language models.

Papers using ACAVCaps (2)

ACAVCaps β€” datasets β€” speech-audio