Analysis Of The BUT Diarization System For Voxconverse Challenge
2020 · Federico Landini, Ondřej Glembek, Pavel Matějka, et al.
Abstract
This paper describes the system developed by the BUT team for the fourth track of the VoxCeleb Speaker Recognition Challenge, focusing on diarization on the VoxConverse dataset. The system consists of signal pre-processing, voice activity detection, speaker embedding extraction, an initial agglomerative hierarchical clustering followed by diarization using a Bayesian hidden Markov model, a reclustering step based on per-speaker global embeddings and overlapped speech detection and handling. We provide comparisons for each of the steps and share the implementation of the most relevant modules of our system. Our system scored second in the challenge in terms of the primary metric (diarization error rate) and first according to the secondary metric (Jaccard error rate).
Authors
(none)
Tags
Stats
Related papers
- The BUCEA Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2022 (2022)0.00
- The HUAWEI Speaker Diarisation System For The Voxceleb Speaker Diarisation Challenge (2020)0.00
- The DKU-MSXF Diarization System For The Voxceleb Speaker Recognition Challenge 2023 (2023)5.24
- BUT System Description For DIHARD Speech Diarization Challenge 2019 (2019)0.00
- North America Bixby Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2021 (2021)0.00
- Microsoft Speaker Diarization System For The Voxceleb Speaker Recognition Challenge 2020 (2020)11.93
- The Dku-duke-lenovo System Description For The Third DIHARD Speech Diarization Challenge (2021)0.00
- Gist-aiter Speaker Diarization System For Voxceleb Speaker Recognition Challenge (voxsrc) 2023 (2023)0.00