The Dku-duke-lenovo System Description For The Third DIHARD Speech Diarization Challenge
2021 Β· Weiqing Wang, Qingjian Lin, Danwei Cai, et al.
Abstract
In this paper, we present the submitted system for the third DIHARD Speech Diarization Challenge from the DKU-Duke-Lenovo team. Our system consists of several modules: voice activity detection (VAD), segmentation, speaker embedding extraction, attentive similarity scoring, agglomerative hierarchical clustering. In addition, the target speaker VAD (TSVAD) is used for the phone call data to further improve the performance. Our final submitted system achieves a DER of 15.43% for the core evaluation set and 13.39% for the full evaluation set on task 1, and we also get a DER of 21.63% for core evaluation set and 18.90% for full evaluation set on task 2.
Authors
(none)
Tags
Stats
Related papers
- DIHARD II Is Still Hard: Experimental Results And Discussions From The DKU-LENOVO Team (2020)6.34
- The Hitachi-jhu DIHARD III System: Competitive End-to-end Neural Diarization And X-vector Clustering Systems Combined By Dover-lap (2021)0.00
- The DKU-MSXF Diarization System For The Voxceleb Speaker Recognition Challenge 2023 (2023)5.24
- UWB-NTIS Speaker Diarization System For The DIHARD II 2019 Challenge (2019)4.52
- The HUAWEI Speaker Diarisation System For The Voxceleb Speaker Diarisation Challenge (2020)0.00
- The Speed Submission To DIHARD II: Contributions & Lessons Learned (2019)0.00
- The Second DIHARD Diarization Challenge: Dataset, Task, And Baselines (2019)15.00
- Analysis Of The BUT Diarization System For Voxconverse Challenge (2020)8.82