Joint Magnitude Estimation And Phase Recovery Using Cycle-in-cycle GAN For Non-parallel Speech Enhancement
2021 Β· Guochen Yu, Andong Li, Yutian Wang, et al.
Abstract
For the lack of adequate paired noisy-clean speech corpus in many real scenarios, non-parallel training is a promising task for DNN-based speech enhancement methods. However, because of the severe mismatch between input and target speeches, many previous studies only focus on the magnitude spectrum estimation and remain the phase unaltered, resulting in the degraded speech quality under low signal-to-noise ratio conditions. To tackle this problem, we decouple the difficult target w.r.t. original spectrum optimization into spectral magnitude and phase, and a novel Cycle-in-Cycle generative adversarial network (dubbed CinCGAN) is proposed to jointly estimate the spectral magnitude and phase information stage by stage under unpaired data. In the first stage, we pretrain a magnitude CycleGAN to coarsely estimate the spectral magnitude of clean speech. In the second stage, we incorporate the pretrained CycleGAN with a complex-valued CycleGAN as a cycle-in-cycle structure to simultaneously r
Authors
(none)
Tags
Stats
Related papers
- Speech Enhancement Based On Cyclegan With Noise-informed Training (2021)5.84
- DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network For Speech Enhancement (2020)0.00
- Conditional Generative Adversarial Networks For Speech Enhancement And Noise-robust Speaker Verification (2017)16.03
- Dynamic Attention Based Generative Adversarial Network With Phase Post-processing For Speech Enhancement (2020)0.00
- A Multi-discriminator Cyclegan For Unsupervised Non-parallel Speech Domain Adaptation (2018)9.76
- High-quality Nonparallel Voice Conversion Based On Cycle-consistent Adversarial Network (2018)0.00
- Tdcgan: Temporal Dilated Convolutional Generative Adversarial Network For End-to-end Speech Enhancement (2020)0.00
- Towards Generalized Speech Enhancement With Generative Adversarial Networks (2019)10.35