An Improved Dnn-based Spectral Feature Mapping That Removes Noise And Reverberation For Robust Automatic Speech Recognition
2018 Β· Juan Pablo Escudero, JosΓ© Novoa, Rodrigo Mahu, et al.
Abstract
Reverberation and additive noise have detrimental effects on the performance of automatic speech recognition systems. In this paper we explore the ability of a DNN-based spectral feature mapping to remove the effects of reverberation and additive noise. Experiments with the CHiME-2 database show that this DNN can achieve an average reduction in WER of 4.5%, when compared to the baseline system, at SNRs equal to -6 dB, -3 dB, 0 dB and 3 dB, and just 0.8% at greater SNRs of 6 dB and 9 dB. These results suggest that this DNN is more effective in removing additive noise than reverberation. To improve the DNN performance, we combine it with the weighted prediction error (WPE) method that shows a complementary behavior. While this combination provided a reduction in WER of approximately 11% when compared with the baseline, the observed improvement is not as great as that obtained using WPE alone. However, modifications to the DNN training process were applied and an average reduction in WER
Authors
(none)
Tags
Stats
Related papers
- An Exploration Of Mimic Architectures For Residual Network Based Spectral Mapping (2018)6.34
- Exploring The Robustness Of Features And Enhancement On Speech Recognition Systems In Highly-reverberant Real Environments (2018)0.00
- Integrated Speech Enhancement Method Based On Weighted Prediction Error And DNN For Dereverberation And Denoising (2017)0.00
- On Combining Features For Single-channel Robust Speech Recognition In Reverberant Environments (2019)0.00
- Neural Network-augmented Kalman Filtering For Robust Online Speech Dereverberation In Noisy Reverberant Environments (2022)0.00
- Speaker Reinforcement Using Target Source Extraction For Robust Automatic Speech Recognition (2022)7.50
- Run-time Adaptation Of Neural Beamforming For Robust Speech Dereverberation And Denoising (2024)0.00
- On The Use Of DNN Autoencoder For Robust Speaker Recognition (2018)0.00