Investigating Generative Adversarial Networks Based Speech Dereverberation For Robust Speech Recognition
2018 Β· Ke Wang, Junbo Zhang, Sining Sun, et al.
Abstract
We investigate the use of generative adversarial networks (GANs) in speech dereverberation for robust speech recognition. GANs have been recently studied for speech enhancement to remove additive noises, but there still lacks of a work to examine their ability in speech dereverberation and the advantages of using GANs have not been fully established. In this paper, we provide deep investigations in the use of GAN-based dereverberation front-end in ASR. First, we study the effectiveness of different dereverberation networks (the generator in GAN) and find that LSTM leads a significant improvement as compared with feed-forward DNN and CNN in our dataset. Second, further adding residual connections in the deep LSTMs can boost the performance as well. Finally, we find that, for the success of GAN, it is important to update the generator and the discriminator using the same mini-batch data during training. Moreover, using reverberant spectrogram as a condition to discriminator, as suggested
Authors
(none)
Tags
Stats
Related papers
- Exploring Speech Enhancement With Generative Adversarial Networks For Robust Speech Recognition (2017)16.14
- Robust Speech Recognition Using Generative Adversarial Networks (2017)11.29
- Dynamic Attention Based Generative Adversarial Network With Phase Post-processing For Speech Enhancement (2020)0.00
- Single-channel Speech Dereverberation Via Generative Adversarial Training (2018)8.09
- DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network For Speech Enhancement (2020)0.00
- Towards Generalized Speech Enhancement With Generative Adversarial Networks (2019)10.35
- Channel-aware Domain-adaptive Generative Adversarial Network For Robust Speech Recognition (2024)4.52
- Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks (2017)16.21