Convolutional Recurrent Neural Network Based Progressive Learning For Monaural Speech Enhancement
2019 Β· Andong Li, Minmin Yuan, Chengshi Zheng, et al.
Abstract
Recently, progressive learning has shown its capacity to improve speech quality and speech intelligibility when it is combined with deep neural network (DNN) and long short-term memory (LSTM) based monaural speech enhancement algorithms, especially in low signal-to-noise ratio (SNR) conditions. Nevertheless, due to a large number of parameters and high computational complexity, it is hard to implement in current resource-limited micro-controllers and thus, it is essential to significantly reduce both the number of parameters and the computational load for practical applications. For this purpose, we propose a novel progressive learning framework with causal convolutional recurrent neural networks called PL-CRNN, which takes advantage of both convolutional neural networks and recurrent neural networks to drastically reduce the number of parameters and simultaneously improve speech quality and speech intelligibility. Numerous experiments verify the effectiveness of the proposed PL-CRNN m
Authors
(none)
Tags
Stats
Related papers
- DCCRN: Deep Complex Convolution Recurrent Network For Phase-aware Speech Enhancement (2020)20.78
- Monaural Speech Enhancement Using A Multi-branch Temporal Convolutional Network (2019)3.58
- Real-time Monaural Speech Enhancement With Short-time Discrete Cosine Transform (2021)0.00
- Rethinking Complex-valued Deep Neural Networks For Monaural Speech Enhancement (2023)6.77
- Constrained Convolutional-recurrent Networks To Improve Speech Quality With Low Impact On Recognition Accuracy (2018)5.24
- Progressive Speech Enhancement With Residual Connections (2019)5.24
- Snr-progressive Model With Harmonic Compensation For Low-snr Speech Enhancement (2024)4.52
- DPCRN: Dual-path Convolution Recurrent Network For Single Channel Speech Enhancement (2021)14.35