Investigation Of Using VAE For I-vector Speaker Verification
2017 Β· Timur Pekhovsky, Maxim Korenevsky
Abstract
New system for i-vector speaker recognition based on variational autoencoder (VAE) is investigated. VAE is a promising approach for developing accurate deep nonlinear generative models of complex data. Experiments show that VAE provides speaker embedding and can be effectively trained in an unsupervised manner. LLR estimate for VAE is developed. Experiments on NIST SRE 2010 data demonstrate its correctness. Additionally, we show that the performance of VAE-based system in the i-vectors space is close to that of the diagonal PLDA. Several interesting results are also observed in the experiments with \(\beta\)-VAE. In particular, we found that for \(\beta\ll 1\), VAE can be trained to capture the features of complex input data distributions in an effective way, which is hard to obtain in the standard VAE (\(\beta=1\)).
Authors
(none)
Tags
Stats
Related papers
- End-to-end DNN Based Speaker Recognition Inspired By I-vector And PLDA (2017)10.35
- Vae-based Regularization For Deep Speaker Embedding (2019)8.09
- Discriminatively Re-trained I-vector Extractor For Speaker Recognition (2018)5.84
- Vae-based Domain Adaptation For Speaker Verification (2019)7.50
- Factorization Of Discriminatively Trained I-vector Extractor For Speaker Recognition (2019)0.00
- I-vector Transformation Using Conditional Generative Adversarial Networks For Short Utterance Speaker Verification (2018)8.35
- Generative X-vectors For Text-independent Speaker Verification (2018)7.16
- Deep Neural Network Based I-vector Mapping For Speaker Verification Using Short Utterances (2018)0.00