AISHELL-4
Emerging6papers using it
2022first seen
AISHELL-4 is a dataset used to evaluate speaker diarization and recognition performance in large audio-language models.
Papers using AISHELL-4 (6)
- Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASRJoint Learning Global-Local Speaker Classification to Enhance End-to-End Speaker Diarization and RecognitionLightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic TransformAsobo: Attentive Beamformer Selection For Distant Speaker Diarization In MeetingsToken-level Speaker Change Detection Using Speaker Difference and Speech
Content via Continuous Integrate-and-fireASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in
Meetings