Shanerun System Description To Voxceleb Speaker Recognition Challenge 2020
2020 Β· Shen Chen
Abstract
In this report, we describe the submission of ShaneRun's team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2020. We use ResNet-34 as encoder to extract the speaker embeddings, which is referenced from the open-source voxceleb-trainer. We also provide a simple method to implement optimum fusion using t-SNE normalized distance of testing utterance pairs instead of original negative Euclidean distance from the encoder. The final submitted system got 0.3098 minDCF and 5.076 % ERR for Fixed data track, which outperformed the baseline by 1.3 % minDCF and 2.2 % ERR respectively.
Authors
(none)
Tags
Stats
Related papers
- The Xx205 System For The Voxceleb Speaker Recognition Challenge 2020 (2020)0.00
- Tongji University Team For The Voxceleb Speaker Recognition Challenge 2020 (2020)0.00
- Tongji University Undergraduate Team For The Voxceleb Speaker Recognition Challenge2020 (2020)0.00
- The Kriston AI System For The Voxceleb Speaker Recognition Challenge 2022 (2022)0.00
- Clova Baseline System For The Voxceleb Speaker Recognition Challenge 2020 (2020)0.00
- Beijing ZKJ-NPU Speaker Verification System For Voxceleb Speaker Recognition Challenge 2021 (2021)0.00
- The Phonexia Voxceleb Speaker Recognition Challenge 2021 System Description (2021)0.00
- Gist-aiter Speaker Diarization System For Voxceleb Speaker Recognition Challenge (voxsrc) 2023 (2023)0.00