MMSU
Emerging6papers using it
2025first seen
Papers using MMSU (6)
- DIFFA-2: A Practical Diffusion Large Language Model for General Audio UnderstandingALARM: Audio-Language Alignment for Reasoning ModelsClosing the Modality Reasoning Gap for Speech Large Language ModelsMiMo-Audio: Audio Language Models are Few-Shot LearnersTASU: Text-Only Alignment for Speech UnderstandingDIFFA: Large Language Diffusion Models Can Listen and Understand