Exploring The Use Of An Unsupervised Autoregressive Model As A Shared Encoder For Text-dependent Speaker Verification
2020 Β· Vijay Ravi, Ruchao Fan, Amber Afshan, et al.
Abstract
In this paper, we propose a novel way of addressing text-dependent automatic speaker verification (TD-ASV) by using a shared-encoder with task-specific decoders. An autoregressive predictive coding (APC) encoder is pre-trained in an unsupervised manner using both out-of-domain (LibriSpeech, VoxCeleb) and in-domain (DeepMine) unlabeled datasets to learn generic, high-level feature representation that encapsulates speaker and phonetic content. Two task-specific decoders were trained using labeled datasets to classify speakers (SID) and phrases (PID). Speaker embeddings extracted from the SID decoder were scored using a PLDA. SID and PID systems were fused at the score level. There is a 51.9% relative improvement in minDCF for our system compared to the fully supervised x-vector baseline on the cross-lingual DeepMine dataset. However, the i-vector/HMM method outperformed the proposed APC encoder-decoder system. A fusion of the x-vector/PLDA baseline and the SID/PLDA scores prior to PID fu
Authors
(none)
Tags
Stats
Related papers
- Data Generation Using Pass-phrase-dependent Deep Auto-encoders For Text-dependent Speaker Verification (2021)0.00
- The SVASR System For Text-dependent Speaker Verification (tdsv) AAIC Challenge 2024 (2024)0.00
- Vocal Tract Length Perturbation For Text-dependent Speaker Verification With Autoregressive Prediction Coding (2020)8.09
- Joint Training Or Not: An Exploration Of Pre-trained Speech Models In Audio-visual Speaker Diarization (2023)0.00
- Large-scale Self-supervised Speech Representation Learning For Automatic Speaker Verification (2021)15.25
- Memory-efficient Training For Text-dependent SV With Independent Pre-trained Models (2024)0.00
- Attention Back-end For Automatic Speaker Verification With Multiple Enrollment Utterances (2021)10.21
- Deep Speaker Embedding Learning With Multi-level Pooling For Text-independent Speaker Verification (2019)0.00