Analysis Of DNN Speech Signal Enhancement For Robust Speaker Recognition
2018 · Ondrej Novotny, Oldrich Plchot, Ondrej Glembek, et al.
Abstract
In this work, we present an analysis of a DNN-based autoencoder for speech enhancement, dereverberation and denoising. The target application is a robust speaker verification (SV) system. We start our approach by carefully designing a data augmentation process to cover wide range of acoustic conditions and obtain rich training data for various components of our SV system. We augment several well-known databases used in SV with artificially noised and reverberated data and we use them to train a denoising autoencoder (mapping noisy and reverberated speech to its clean version) as well as an x-vector extractor which is currently considered as state-of-the-art in SV. Later, we use the autoencoder as a preprocessing step for text-independent SV system. We compare results achieved with autoencoder enhancement, multi-condition PLDA training and their simultaneous use. We present a detailed analysis with various conditions of NIST SRE 2010, 2016, PRISM and with re-transmitted data. We conclud
Authors
(none)
Tags
Stats
Related papers
- On The Use Of DNN Autoencoder For Robust Speaker Recognition (2018)0.00
- End-to-end Recurrent Denoising Autoencoder Embeddings For Speaker Identification (2020)6.34
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Speech Denoising By Parametric Resynthesis (2019)7.16
- Speaker Reinforcement Using Target Source Extraction For Robust Automatic Speech Recognition (2022)7.50
- Unsupervised Feature Enhancement For Speaker Verification (2019)5.84
- Incorporating Real-world Noisy Speech In Neural-network-based Speech Enhancement Systems (2021)5.84
- Feature Enhancement With Deep Feature Losses For Speaker Verification (2019)10.61