PAS: Partial Additive Speech Data Augmentation Method For Noise Robust Speaker Verification
2023 Β· Wonbin Kim, Hyun-Seo Shin, Ju-Ho Kim, et al.
Abstract
Background noise reduces speech intelligibility and quality, making speaker verification (SV) in noisy environments a challenging task. To improve the noise robustness of SV systems, additive noise data augmentation method has been commonly used. In this paper, we propose a new additive noise method, partial additive speech (PAS), which aims to train SV systems to be less affected by noisy environments. The experimental results demonstrate that PAS outperforms traditional additive noise in terms of equal error rates (EER), with relative improvements of 4.64% and 5.01% observed in SE-ResNet34 and ECAPA-TDNN. We also show the effectiveness of proposed method by analyzing attention modules and visualizing speaker embeddings.
Authors
(none)
Tags
Stats
Related papers
- Data Augmentation Enhanced Speaker Enrollment For Text-dependent Speaker Verification (2020)0.00
- Adaptive Data Augmentation With Naturalspeech3 For Far-field Speaker Verification (2025)0.00
- A Joint Noise Disentanglement And Adversarial Training Framework For Robust Speaker Verification (2024)6.34
- Diff-sv: A Unified Hierarchical Framework For Noise-robust Speaker Verification Using Score-based Diffusion Probabilistic Models (2023)6.34
- Unsupervised Feature Enhancement For Speaker Verification (2019)5.84
- Diffusion-based Adversarial Purification For Speaker Verification (2023)6.34
- Obovox Far Field Speaker Recognition: A Novel Data Augmentation Approach With Pretrained Models (2024)0.00
- Exploring Voice Conversion Based Data Augmentation In Text-dependent Speaker Verification (2020)0.00