On Bottleneck Features For Text-dependent Speaker Verification Using X-vectors
2020 Β· Achintya Kumar Sarkar, Zheng-Hua Tan
Abstract
Applying x-vectors for speaker verification has recently attracted great interest, with the focus being on text-independent speaker verification. In this paper, we study x-vectors for text-dependent speaker verification (TD-SV), which remains unexplored. We further investigate the impact of the different bottleneck (BN) features on the performance of x-vectors, including the recently-introduced time-contrastive-learning (TCL) BN features and phone-discriminant BN features. TCL is a weakly supervised learning approach that constructs training data by uniformly partitioning each utterance into a predefined number of segments and then assigning each segment a class label depending on their position in the utterance. We also compare TD-SV performance for different modeling techniques, including the Gaussian mixture models-universal background model (GMM-UBM), i-vector, and x-vector. Experiments are conducted on the RedDots 2016 challenge database. It is found that the type of features has
Authors
(none)
Tags
Stats
Related papers
- Time-contrastive Learning Based DNN Bottleneck Features For Text-dependent Speaker Verification (2017)9.92
- Time-contrastive Learning Based Deep Bottleneck Features For Text-dependent Speaker Verification (2019)9.92
- Comparison Of Multiple Features And Modeling Methods For Text-dependent Speaker Verification (2017)0.00
- Vocal Tract Length Perturbation For Text-dependent Speaker Verification With Autoregressive Prediction Coding (2020)8.09
- Generative X-vectors For Text-independent Speaker Verification (2018)7.16
- Gaussian Speaker Embedding Learning For Text-independent Speaker Verification (2020)0.00
- P-vectors: A Parallel-coupled Tdnn/transformer Network For Speaker Verification (2023)5.84
- Probing The Information Encoded In X-vectors (2019)13.23