Robust Text-dependent Speaker Verification Via Character-level Information Preservation For The Sdsv Challenge 2020
2020 Β· Sung Hwan Mun, Woo Hyun Kang, Min Hyun Han, et al.
Abstract
This paper describes our submission to Task 1 of the Short-duration Speaker Verification (SdSV) challenge 2020. Task 1 is a text-dependent speaker verification task, where both the speaker and phrase are required to be verified. The submitted systems were composed of TDNN-based and ResNet-based front-end architectures, in which the frame-level features were aggregated with various pooling methods (e.g., statistical, self-attentive, ghostVLAD pooling). Although the conventional pooling methods provide embeddings with a sufficient amount of speaker-dependent information, our experiments show that these embeddings often lack phrase-dependent information. To mitigate this problem, we propose a new pooling and score compensation methods that leverage a CTC-based automatic speech recognition (ASR) model for taking the lexical content into account. Both methods showed improvement over the conventional techniques, and the best performance was achieved by fusing all the experimented systems, wh
Authors
(none)
Tags
Stats
Related papers
- The SVASR System For Text-dependent Speaker Verification (tdsv) AAIC Challenge 2024 (2024)0.00
- Short-duration Speaker Verification (sdsv) Challenge 2021: The Challenge Evaluation Plan (2019)0.00
- Integrating Frequency Translational Invariance In Tdnns And Frequency Positional Information In 2D Resnets To Enhance Speaker Verification (2021)12.68
- Memory-efficient Training For Text-dependent SV With Independent Pre-trained Models (2024)0.00
- Asymmetric And Trial-dependent Modeling: The Contribution Of LIA To Sdsv Challenge Task 2 (2024)0.00
- A Text-dependent Speaker Verification Application Framework Based On Chinese Numerical String Corpus (2023)0.00
- Deep Speaker Embedding Learning With Multi-level Pooling For Text-independent Speaker Verification (2019)0.00
- NPU Speaker Verification System For INTERSPEECH 2020 Far-field Speaker Verification Challenge (2020)7.50