Tdcgan: Temporal Dilated Convolutional Generative Adversarial Network For End-to-end Speech Enhancement
2020 Β· Shuaishuai Ye, Xinhui Hu, Xinkang Xu
Abstract
In this paper, in order to further deal with the performance degradation caused by ignoring the phase information in conventional speech enhancement systems, we proposed a temporal dilated convolutional generative adversarial network (TDCGAN) in the end-to-end based speech enhancement architecture. For the first time, we introduced the temporal dilated convolutional network with depthwise separable convolutions into the GAN structure so that the receptive field can be greatly increased without increasing the number of parameters. We also first explored the effect of signal-to-noise ratio (SNR) penalty item as regularization of the loss function of generator on improving the SNR of enhanced speech. The experimental results demonstrated that our proposed method outperformed the state-of-the-art end-to-end GAN-based speech enhancement. Moreover, compared with previous GAN-based methods, the proposed TDCGAN could greatly decreased the number of parameters. As expected, the work also demons
Authors
(none)
Tags
Stats
Related papers
- DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network For Speech Enhancement (2020)0.00
- Dynamic Attention Based Generative Adversarial Network With Phase Post-processing For Speech Enhancement (2020)0.00
- SEGAN: Speech Enhancement Generative Adversarial Network (2017)21.85
- Towards Generalized Speech Enhancement With Generative Adversarial Networks (2019)10.35
- CMGAN: Conformer-based Metric-gan For Monaural Speech Enhancement (2022)14.80
- Gan-based Speech Enhancement For Low SNR Using Latent Feature Conditioning (2024)5.24
- CMGAN: Conformer-based Metric GAN For Speech Enhancement (2022)15.13
- Multi-metric Optimization Using Generative Adversarial Networks For Near-end Speech Intelligibility Enhancement (2021)8.60