Comparison Of Multiple Features And Modeling Methods For Text-dependent Speaker Verification
2017 Β· Yi Liu, Liang He, Yao Tian, et al.
Abstract
Text-dependent speaker verification is becoming popular in the speaker recognition society. However, the conventional i-vector framework which has been successful for speaker identification and other similar tasks works relatively poorly in this task. Researchers have proposed several new methods to improve performance, but it is still unclear that which model is the best choice, especially when the pass-phrases are prompted during enrollment and test. In this paper, we introduce four modeling methods and compare their performance on the newly published RedDots dataset. To further explore the influence of different frame alignments, Viterbi and forward-backward algorithms are both used in the HMM-based models. Several bottleneck features are also investigated. Our experiments show that, by explicitly modeling the lexical content, the HMM-based modeling achieves good results in the fixed-phrase condition. In the prompted-phrase condition, GMM-HMM and i-vector/HMM are not as successful.
Authors
(none)
Tags
Stats
Related papers
- A Text-independent Speaker Verification Model: A Comparative Analysis (2017)8.60
- On Bottleneck Features For Text-dependent Speaker Verification Using X-vectors (2020)0.00
- Spoken Pass-phrase Verification In The I-vector Space (2018)0.00
- Deep Neural Network Based I-vector Mapping For Speaker Verification Using Short Utterances (2018)0.00
- Speaker Recognition With Random Digit Strings Using Uncertainty Normalized Hmm-based I-vectors (2019)8.82
- An Empirical Study On Text-independent Speaker Verification Based On The GE2E Method (2020)0.00
- End-to-end DNN Based Speaker Recognition Inspired By I-vector And PLDA (2017)10.35
- End-to-end Attention Based Text-dependent Speaker Verification (2017)14.87