Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild
2020 · Phani Sankar Nidadavolu, Saurabh Kataria, Paola García-Perera, et al.
Abstract
We investigated an enhancement and a domain adaptation approach to make speaker verification systems robust to perturbations of far-field speech. In the enhancement approach, using paired (parallel) reverberant-clean speech, we trained a supervised Generative Adversarial Network (GAN) along with a feature mapping loss. For the domain adaptation approach, we trained a Cycle Consistent Generative Adversarial Network (CycleGAN), which maps features from far-field domain to the speaker embedding training domain. This was trained on unpaired data in an unsupervised manner. Both networks, termed Supervised Enhancement Network (SEN) and Domain Adaptation Network (DAN) respectively, were trained with multi-task objectives in (filter-bank) feature domain. On a simulated test setup, we first note the benefit of using feature mapping (FM) loss along with adversarial loss in SEN. Then, we tested both supervised and unsupervised approaches on several real noisy datasets. We observed relative improv
Authors
(none)
Tags
Stats
Related papers
- Feature Enhancement With Deep Feature Losses For Speaker Verification (2019)10.61
- Unsupervised Feature Enhancement For Speaker Verification (2019)5.84
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Generative Adversarial Speaker Embedding Networks For Domain Robust End-to-end Speaker Verification (2018)0.00
- Parameterized Channel Normalization For Far-field Deep Speaker Verification (2021)3.58
- Multi-channel Speaker Verification For Single And Multi-talker Speech (2020)0.00
- Adaptive Data Augmentation With Naturalspeech3 For Far-field Speaker Verification (2025)0.00
- Improved Far-field Speech Recognition Using Joint Variational Autoencoder (2022)0.00