On The Use Of DNN Autoencoder For Robust Speaker Recognition
2018 · Ondrej Novotny, Oldrich Plchot, Pavel Matejka, et al.
Abstract
In this paper, we present an analysis of a DNN-based autoencoder for speech enhancement, dereverberation and denoising. The target application is a robust speaker recognition system. We started with augmenting the Fisher database with artificially noised and reverberated data and we trained the autoencoder to map noisy and reverberated speech to its clean version. We use the autoencoder as a preprocessing step for a state-of-the-art text-independent speaker recognition system. We compare results achieved with pure autoencoder enhancement, multi-condition PLDA training and their simultaneous use. We present a detailed analysis with various conditions of NIST SRE 2010, PRISM and artificially corrupted NIST SRE 2010 telephone condition. We conclude that the proposed preprocessing significantly outperforms the baseline and that this technique can be used to build a robust speaker recognition system for reverberated and noisy data.
Authors
(none)
Tags
Stats
Related papers
- Analysis Of DNN Speech Signal Enhancement For Robust Speaker Recognition (2018)11.39
- End-to-end Recurrent Denoising Autoencoder Embeddings For Speaker Identification (2020)6.34
- Speaker Reinforcement Using Target Source Extraction For Robust Automatic Speech Recognition (2022)7.50
- Deep Learning Based Dereverberation Of Temporal Envelopesfor Robust Speech Recognition (2020)5.84
- Speech Denoising By Parametric Resynthesis (2019)7.16
- Neural Network-augmented Kalman Filtering For Robust Online Speech Dereverberation In Noisy Reverberant Environments (2022)0.00
- Run-time Adaptation Of Neural Beamforming For Robust Speech Dereverberation And Denoising (2024)0.00
- Exploring The Robustness Of Features And Enhancement On Speech Recognition Systems In Highly-reverberant Real Environments (2018)0.00