Deep Speaker Embeddings For Far-field Speaker Recognition On Short Utterances
2020 Β· Aleksei Gusev, Vladimir Volokhov, Tseren Andzhukaev, et al.
Abstract
Speaker recognition systems based on deep speaker embeddings have achieved significant performance in controlled conditions according to the results obtained for early NIST SRE (Speaker Recognition Evaluation) datasets. From the practical point of view, taking into account the increased interest in virtual assistants (such as Amazon Alexa, Google Home, AppleSiri, etc.), speaker verification on short utterances in uncontrolled noisy environment conditions is one of the most challenging and highly demanded tasks. This paper presents approaches aimed to achieve two goals: a) improve the quality of far-field speaker verification systems in the presence of environmental noise, reverberation and b) reduce the system qualitydegradation for short utterances. For these purposes, we considered deep neural network architectures based on TDNN (TimeDelay Neural Network) and ResNet (Residual Neural Network) blocks. We experimented with state-of-the-art embedding extractors and their training procedu
Authors
(none)
Tags
Stats
Related papers
- STC Speaker Recognition Systems For The Voices From A Distance Challenge (2019)7.81
- Deep Speaker Embedding Learning With Multi-level Pooling For Text-independent Speaker Verification (2019)0.00
- Length- And Noise-aware Training Techniques For Short-utterance Speaker Recognition (2020)0.00
- NPU Speaker Verification System For INTERSPEECH 2020 Far-field Speaker Verification Challenge (2020)7.50
- How To Improve Your Speaker Embeddings Extractor In Generic Toolkits (2018)9.76
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- On Deep Speaker Embeddings For Text-independent Speaker Recognition (2018)11.93
- DNN Based Speaker Recognition On Short Utterances (2016)0.00