Explicit Estimation Of Magnitude And Phase Spectra In Parallel For High-quality Speech Enhancement
2023 Β· Ye-Xin Lu, Yang Ai, Zhen-Hua Ling
Abstract
Phase information has a significant impact on speech perceptual quality and intelligibility. However, existing speech enhancement methods encounter limitations in explicit phase estimation due to the non-structural nature and wrapping characteristics of the phase, leading to a bottleneck in enhanced speech quality. To overcome the above issue, in this paper, we proposed MP-SENet, a novel Speech Enhancement Network that explicitly enhances Magnitude and Phase spectra in parallel. The proposed MP-SENet comprises a Transformer-embedded encoder-decoder architecture. The encoder aims to encode the input distorted magnitude and phase spectra into time-frequency representations, which are further fed into time-frequency Transformers for alternatively capturing time and frequency dependencies. The decoder comprises a magnitude mask decoder and a phase decoder, directly enhancing magnitude and wrapped phase spectra by incorporating a magnitude masking architecture and a phase parallel estimatio
Authors
(none)
Tags
Stats
Related papers
- Mp-senet: A Speech Enhancement Model With Parallel Denoising Of Magnitude And Phase Spectra (2023)15.51
- Magnitude-and-phase-aware Speech Enhancement With Parallel Sequence Modeling (2023)3.58
- Magnitude-phase Dual-path Speech Enhancement Network Based On Self-supervised Embedding And Perceptual Contrast Stretch Boosting (2025)3.21
- PHASEN: A Phase-and-harmonics-aware Speech Enhancement Network (2019)18.20
- Percepnet+: A Phase And SNR Aware Percepnet For Real-time Speech Enhancement (2022)9.23
- Dbt-net: Dual-branch Federative Magnitude And Phase Estimation With Attention-in-attention Transformer For Monaural Speech Enhancement (2022)12.47
- Phase-aware Speech Enhancement With Deep Complex U-net (2019)0.00
- Phase Aware Speech Enhancement Using Realisation Of Complex-valued LSTM (2020)0.00