ACAVCaps
Emerging2papers using it
2025first seen
ACAVCaps is a large-scale, fine-grained, and multi-faceted audio captioning dataset designed to evaluate general audio understanding in large audio-language models.
ACAVCaps is a large-scale, fine-grained, and multi-faceted audio captioning dataset designed to evaluate general audio understanding in large audio-language models.