Deep Interaction Between Masking And Mapping Targets For Single-channel Speech Enhancement
2021 Β· Lu Zhang, Mingjiang Wang, Zehua Zhang, et al.
Abstract
The most recent deep neural network (DNN) models exhibit impressive denoising performance in the time-frequency (T-F) magnitude domain. However, the phase is also a critical component of the speech signal that is easily overlooked. In this paper, we propose a multi-branch dilated convolutional network (DCN) to simultaneously enhance the magnitude and phase of noisy speech. A causal and robust monaural speech enhancement system is achieved based on the multi-objective learning framework of the complex spectrum and the ideal ratio mask (IRM) targets. In the process of joint learning, the intermediate estimation of IRM targets is used as a way of generating feature attention factors to realize the information interaction between the two targets. Moreover, the proposed multi-scale dilated convolution enables the DCN model to have a more efficient temporal modeling capability. Experimental results show that compared with other state-of-the-art models, this model achieves better speech quali
Authors
(none)
Tags
Stats
Related papers
- Incorporating Multi-target In Multi-stage Speech Enhancement Model For Better Generalization (2021)0.00
- Consistency-aware Multi-channel Speech Enhancement Using Deep Neural Networks (2020)0.00
- Distortionless Multi-channel Target Speech Enhancement For Overlapped Speech Recognition (2020)0.00
- Concatenated Identical DNN (CI-DNN) To Reduce Noise-type Dependence In Dnn-based Speech Enhancement (2018)5.24
- PHASEN: A Phase-and-harmonics-aware Speech Enhancement Network (2019)18.20
- Spatial-dccrn: Dccrn Equipped With Frame-level Angle Feature And Hybrid Filtering For Multi-channel Speech Enhancement (2022)5.84
- Multi-modal Hybrid Deep Neural Network For Speech Enhancement (2016)0.00
- D2former: A Fully Complex Dual-path Dual-decoder Conformer Network Using Joint Complex Masking And Complex Spectral Mapping For Monaural Speech Enhancement (2023)0.00