Deep Griffin-lim Iteration
2019 Β· Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, et al.
Abstract
This paper presents a novel phase reconstruction method (only from a given amplitude spectrogram) by combining a signal-processing-based approach and a deep neural network (DNN). To retrieve a time-domain signal from its amplitude spectrogram, the corresponding phase is required. One of the popular phase reconstruction methods is the Griffin-Lim algorithm (GLA), which is based on the redundancy of the short-time Fourier transform. However, GLA often involves many iterations and produces low-quality signals owing to the lack of prior knowledge of the target signal. In order to address these issues, in this study, we propose an architecture which stacks a sub-block including two GLA-inspired fixed layers and a DNN. The number of stacked sub-blocks is adjustable, and we can trade the performance and computational load based on requirements of applications. The effectiveness of the proposed method is investigated by reconstructing phases from amplitude spectrograms of speeches.
Authors
(none)
Tags
Stats
Related papers
- Phase Reconstruction From Amplitude Spectrograms Based On Von-mises-distribution Deep Neural Network (2018)11.85
- Phase Reconstruction Based On Recurrent Phase Unwrapping With Deep Neural Networks (2020)9.59
- Generative Adversarial Network-based Approach To Signal Reconstruction From Magnitude Spectrograms (2018)10.97
- Deep Learning Based Phase Reconstruction For Speaker Separation: A Trigonometric Perspective (2018)13.34
- PHASEN: A Phase-and-harmonics-aware Speech Enhancement Network (2019)18.20
- Neural Speech Phase Prediction Based On Parallel Estimation Architecture And Anti-wrapping Losses (2022)11.39
- End-to-end Speech Separation With Unfolded Iterative Phase Reconstruction (2018)15.00
- DCCRN: Deep Complex Convolution Recurrent Network For Phase-aware Speech Enhancement (2020)20.78