Royalflush Speaker Diarization System For ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
2022 Β· Jingguang Tian, Xinhui Hu, Xinkang Xu
Abstract
This paper describes the Royalflush speaker diarization system submitted to the Multi-channel Multi-party Meeting Transcription Challenge(M2MeT). Our system comprises speech enhancement, overlapped speech detection, speaker embedding extraction, speaker clustering, speech separation and system fusion. In this system, we made three contributions. First, we propose an architecture of combining the multi-channel and U-Net-based models, aiming at utilizing the benefits of these two individual architectures, for far-field overlapped speech detection. Second, in order to use overlapped speech detection model to help speaker diarization, a speech separation based overlapped speech handling approach, in which the speaker verification technique is further applied, is proposed. Third, we explore three speaker embedding methods, and obtained the state-of-the-art performance on the CNCeleb-E test set. With these proposals, our best individual system significantly reduces DER from 15.25% to 6.40%,
Authors
(none)
Tags
Stats
Related papers
- The CUHK-TENCENT Speaker Diarization System For The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (2022)7.81
- The Volcspeech System For The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (2022)5.84
- The Ustc-ximalaya System For The ICASSP 2022 Multi-channel Multi-party Meeting Transcription (m2met) Challenge (2022)6.34
- The Xmuspeech System For Multi-channel Multi-party Meeting Transcription Challenge (2022)0.00
- The Royalflush Automatic Speech Diarization And Recognition System For In-car Multi-channel Automatic Speech Recognition Challenge (2024)0.00
- Microsoft Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2020 (2020)11.93
- Cross-channel Attention-based Target Speaker Voice Activity Detection: Experimental Results For M2met Challenge (2022)10.07
- Summary On The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Grand Challenge (2022)10.35