An Empirical Study On Text-independent Speaker Verification Based On The GE2E Method
2020 Β· Soroosh Tayebi Arasteh
Abstract
While many researchers in the speaker recognition area have started to replace the former classical state-of-the-art methods with deep learning techniques, some of the traditional i-vector-based methods are still state-of-the-art in the context of text-independent speaker verification. Google's Generalized End-to-End Loss for Speaker Verification (GE2E), a deep learning-based technique using long short-term memory units, has recently gained a lot of attention due to its speed in convergence and generalization. In this study, we aim at further studying the GE2E method and comparing different scenarios in order to investigate all of its aspects. Various experiments including the effects of a random sampling of test and enrollment utterances, test utterance duration, and the number of enrollment utterances are discussed in this article. Furthermore, we compare the GE2E method with the baseline state-of-the-art i-vector-based methods for text-independent speaker verification and show that
Authors
(none)
Tags
Stats
Related papers
- Generalized End-to-end Loss For Speaker Verification (2017)20.58
- End-to-end Trainable Self-attentive Shallow Network For Text-independent Speaker Verification (2020)0.00
- Joint Speaker Encoder And Neural Back-end Model For Fully End-to-end Automatic Speaker Verification With Multiple Enrollment Utterances (2022)0.00
- Investigation Of Using VAE For I-vector Speaker Verification (2017)0.00
- Deep Speaker Verification: Do We Need End To End? (2017)7.50
- End-to-end DNN Based Speaker Recognition Inspired By I-vector And PLDA (2017)10.35
- Comparison Of Multiple Features And Modeling Methods For Text-dependent Speaker Verification (2017)0.00
- End-to-end Attention Based Text-dependent Speaker Verification (2017)14.87