Speech Dereverberation With Context-aware Recurrent Neural Networks
2017 Β· Joao Felipe Santos, Tiago H. Falk
Abstract
In this paper, we propose a model to perform speech dereverberation by estimating its spectral magnitude from the reverberant counterpart. Our models are capable of extracting features that take into account both short and long-term dependencies in the signal through a convolutional encoder (which extracts features from a short, bounded context of frames) and a recurrent neural network for extracting long-term information. Our model outperforms a recently proposed model that uses different context information depending on the reverberation time, without requiring any sort of additional input, yielding improvements of up to 0.4 on PESQ, 0.3 on STOI, and 1.0 on POLQA relative to reverberant speech. We also show our model is able to generalize to real room impulse responses even when only trained with simulated room impulse responses, different speakers, and high reverberation times. Lastly, listening tests show the proposed method outperforming benchmark models in reduction of perceived
Authors
(none)
Tags
Stats
Related papers
- Convolutive Prediction For Monaural Speech Dereverberation And Noisy-reverberant Speaker Separation (2021)11.39
- Tecanet: Temporal-contextual Attention Network For Environment-aware Speech Dereverberation (2021)7.50
- Speech Dereverberation Using Nonnegative Convolutive Transfer Function And Spectro Temporal Modeling (2017)10.48
- Deep Learning Based Dereverberation Of Temporal Envelopesfor Robust Speech Recognition (2020)5.84
- Speech Dereverberation Using Fully Convolutional Networks (2018)13.34
- RVAE-EM: Generative Speech Dereverberation Based On Recurrent Variational Auto-encoder And Convolutive Transfer Function (2023)7.50
- Speech Enhancement With Wide Residual Networks In Reverberant Environments (2019)0.00
- Neural Network-augmented Kalman Filtering For Robust Online Speech Dereverberation In Noisy Reverberant Environments (2022)0.00