The Chime-7 DASR Challenge: Distant Meeting Transcription With Multiple Devices In Diverse Scenarios
2023 Β· Samuele Cornell, Matthew Wiesner, Shinji Watanabe, et al.
Abstract
The CHiME challenges have played a significant role in the development and evaluation of robust automatic speech recognition (ASR) systems. We introduce the CHiME-7 distant ASR (DASR) task, within the 7th CHiME challenge. This task comprises joint ASR and diarization in far-field settings with multiple, and possibly heterogeneous, recording devices. Different from previous challenges, we evaluate systems on 3 diverse scenarios: CHiME-6, DiPCo, and Mixer 6. The goal is for participants to devise a single system that can generalize across different array geometries and use cases with no a-priori information. Another departure from earlier CHiME iterations is that participants are allowed to use open-source pre-trained models and datasets. In this paper, we describe the challenge design, motivation, and fundamental research questions in detail. We also present the baseline system, which is fully array-topology agnostic and features multi-channel diarization, channel selection, guided sour
Authors
(none)
Tags
Stats
Related papers
- Automatic Channel Selection And Spatial Feature Integration For Multi-channel Speech Recognition Across Various Array Topologies (2023)8.09
- NTT Speaker Diarization System For Chime-7: Multi-domain, Multi-microphone End-to-end And Vector Clustering Diarization (2023)7.16
- The Second DIHARD Diarization Challenge: Dataset, Task, And Baselines (2019)15.00
- Towards A Competitive End-to-end Speech Recognition For Chime-6 Dinner Party Transcription (2020)6.77
- The Royalflush Automatic Speech Diarization And Recognition System For In-car Multi-channel Automatic Speech Recognition Challenge (2024)0.00
- Summary On The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Grand Challenge (2022)10.35
- Multi-microphone Automatic Speech Segmentation In Meetings Based On Circular Harmonics Features (2023)0.00
- DISPLACE Challenge: Diarization Of Speaker And Language In Conversational Environments (2023)0.00