Cross-lingual Speaker Verification With Deep Feature Learning
2017 Β· Lantian Li, Dong Wang, Askar Rozi, et al.
Abstract
Existing speaker verification (SV) systems often suffer from performance degradation if there is any language mismatch between model training, speaker enrollment, and test. A major cause of this degradation is that most existing SV methods rely on a probabilistic model to infer the speaker factor, so any significant change on the distribution of the speech signal will impact the inference. Recently, we proposed a deep learning model that can learn how to extract the speaker factor by a deep neural network (DNN). By this feature learning, an SV system can be constructed with a very simple back-end model. In this paper, we investigate the robustness of the feature-based SV system in situations with language mismatch. Our experiments were conducted on a complex cross-lingual scenario, where the model training was in English, and the enrollment and test were in Chinese or Uyghur. The experiments demonstrated that the feature-based system outperformed the i-vector system with a large margin
Authors
(none)
Tags
Stats
Related papers
- Feature Enhancement With Deep Feature Losses For Speaker Verification (2019)10.61
- Speaker Verification In Multi-speaker Environments Using Temporal Feature Fusion (2022)0.00
- Tackling The Score Shift In Cross-lingual Speaker Verification By Exploiting Language Information (2021)3.58
- Deep Speaker Feature Learning For Text-independent Speaker Verification (2017)12.54
- Coupling A Generative Model With A Discriminative Learning Framework For Speaker Verification (2021)5.24
- Deep Speaker Verification: Do We Need End To End? (2017)7.50
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Efficient Black-box Speaker Verification Model Adaptation With Reprogramming And Backend Learning (2023)0.00