The HUAWEI Speaker Diarisation System For The Voxceleb Speaker Diarisation Challenge
2020 Β· Renyu Wang, Ruilin Tong, Yu Ting Yeung, et al.
Abstract
This paper describes system setup of our submission to speaker diarisation track (Track 4) of VoxCeleb Speaker Recognition Challenge 2020. Our diarisation system consists of a well-trained neural network based speech enhancement model as pre-processing front-end of input speech signals. We replace conventional energy-based voice activity detection (VAD) with a neural network based VAD. The neural network based VAD provides more accurate annotation of speech segments containing only background music, noise, and other interference, which is crucial to diarisation performance. We apply agglomerative hierarchical clustering (AHC) of x-vectors and variational Bayesian hidden Markov model (VB-HMM) based iterative clustering for speaker clustering. Experimental results demonstrate that our proposed system achieves substantial improvements over the baseline system, yielding diarisation error rate (DER) of 10.45%, and Jacard error rate (JER) of 22.46% on the evaluation set.
Authors
(none)
Tags
Stats
Related papers
- Microsoft Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2020 (2020)11.93
- The DKU-MSXF Diarization System For The Voxceleb Speaker Recognition Challenge 2023 (2023)5.24
- The BUCEA Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2022 (2022)0.00
- Analysis Of The BUT Diarization System For Voxconverse Challenge (2020)8.82
- North America Bixby Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2021 (2021)0.00
- Gist-aiter Speaker Diarization System For Voxceleb Speaker Recognition Challenge (voxsrc) 2023 (2023)0.00
- The Newsbridge -telecom Sudparis Voxceleb Speaker Recognition Challenge 2022 System Description (2023)0.00
- The Dku-duke-lenovo System Description For The Third DIHARD Speech Diarization Challenge (2021)0.00