Deep Neural Network Based I-vector Mapping For Speaker Verification Using Short Utterances
2018 Β· Jinxi Guo, Ning Xu, Kailun Qian, et al.
Abstract
Text-independent speaker recognition using short utterances is a highly challenging task due to the large variation and content mismatch between short utterances. I-vector based systems have become the standard in speaker verification applications, but they are less effective with short utterances. In this paper, we first compare two state-of-the-art universal background model training methods for i-vector modeling using full-length and short utterance evaluation tasks. The two methods are Gaussian mixture model (GMM) based and deep neural network (DNN) based methods. The results indicate that the I-vector_DNN system outperforms the I-vector_GMM system under various durations. However, the performances of both systems degrade significantly as the duration of the utterances decreases. To address this issue, we propose two novel nonlinear mapping methods which train DNN models to map the i-vectors extracted from short utterances to their corresponding long-utterance i-vectors. The mapped
Authors
(none)
Tags
Stats
Related papers
- End-to-end DNN Based Speaker Recognition Inspired By I-vector And PLDA (2017)10.35
- DNN Based Speaker Recognition On Short Utterances (2016)0.00
- Text-independent Speaker Verification Based On Deep Neural Networks And Segmental Dynamic Time Warping (2018)3.58
- I-vector Transformation Using Conditional Generative Adversarial Networks For Short Utterance Speaker Verification (2018)8.35
- Multi-task Learning With High-order Statistics For X-vector Based Text-independent Speaker Verification (2019)8.35
- System Combination For Short Utterance Speaker Recognition (2016)5.84
- Bayesian X-vector: Bayesian Neural Network Based X-vector System For Speaker Verification (2020)6.77
- Linear Regression For Speaker Verification (2018)0.00