Skipconvgan: Monaural Speech Dereverberation Using Generative Adversarial Networks Via Complex Time-frequency Masking
2022 Β· Vinay Kothapally, J. H. L. Hansen
Abstract
With the advancements in deep learning approaches, the performance of speech enhancing systems in the presence of background noise have shown significant improvements. However, improving the system's robustness against reverberation is still a work in progress, as reverberation tends to cause loss of formant structure due to smearing effects in time and frequency. A wide range of deep learning-based systems either enhance the magnitude response and reuse the distorted phase or enhance complex spectrogram using a complex time-frequency mask. Though these approaches have demonstrated satisfactory performance, they do not directly address the lost formant structure caused by reverberation. We believe that retrieving the formant structure can help improve the efficiency of existing systems. In this study, we propose SkipConvGAN - an extension of our prior work SkipConvNet. The proposed system's generator network tries to estimate an efficient complex time-frequency mask, while the discrimi
Authors
(none)
Tags
Stats
Related papers
- DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network For Speech Enhancement (2020)0.00
- Investigating Generative Adversarial Networks Based Speech Dereverberation For Robust Speech Recognition (2018)10.74
- Skipconvnet: Skip Convolutional Neural Network For Speech Dereverberation Using Optimally Smoothed Spectral Mapping (2020)10.21
- Towards Generalized Speech Enhancement With Generative Adversarial Networks (2019)10.35
- Exploring Speech Enhancement With Generative Adversarial Networks For Robust Speech Recognition (2017)16.14
- CMGAN: Conformer-based Metric-gan For Monaural Speech Enhancement (2022)14.80
- CMGAN: Conformer-based Metric GAN For Speech Enhancement (2022)15.13
- SEGAN: Speech Enhancement Generative Adversarial Network (2017)21.85