Speech Dereverberation Using Nonnegative Convolutive Transfer Function And Spectro Temporal Modeling
2017 Β· Nasser Mohammadiha, Simon Doclo
Abstract
This paper presents two single channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space. For both methods, the room acoustics are modeled using a nonnegative approximation of the convolutive transfer function (NCTF), and to additionally exploit the spectral properties of the speech signal, such as the low rank nature of the speech spectrogram, the speech spectrogram is modeled using nonnegative matrix factorization (NMF). Two methods are described to combine the NCTF and NMF models. In the first method, referred to as the integrated method, a cost function is constructed by directly integrating the speech NMF model into the NCTF model, while in the second method, referred to as the weighted method, the NCTF and NMF based cost functions are weighted and summed. Efficient update rules are derived to solve both optimization problems. In addition, an extension of the integrated method is presented, which exploits the tempo
Authors
(none)
Tags
Stats
Related papers
- Speech Dereverberation With Context-aware Recurrent Neural Networks (2017)10.35
- Speech Dereverberation Using Fully Convolutional Networks (2018)13.34
- Utterance Weighted Multi-dilation Temporal Convolutional Networks For Monaural Speech Dereverberation (2022)7.50
- Convolutive Prediction For Monaural Speech Dereverberation And Noisy-reverberant Speaker Separation (2021)11.39
- Receptive Field Analysis Of Temporal Convolutional Networks For Monaural Speech Dereverberation (2022)6.34
- Deep Learning Based Dereverberation Of Temporal Envelopesfor Robust Speech Recognition (2020)5.84
- Audio-visual Speech Separation And Dereverberation With A Two-stage Multimodal Network (2019)12.47
- Deep Convolutional Neural Network-based Inverse Filtering Approach For Speech De-reverberation (2020)7.16