Concatenated Identical DNN (CI-DNN) To Reduce Noise-type Dependence In Dnn-based Speech Enhancement
2018 Β· Ziyi Xu, Maximilian Strake, Tim Fingscheidt
Abstract
Estimating time-frequency domain masks for speech enhancement using deep learning approaches has recently become a popular field of research. In this paper, we propose a mask-based speech enhancement framework by using concatenated identical deep neural networks (CI-DNNs). The idea is that a single DNN is trained under multiple input and output signal-to-noise power ratio (SNR) conditions, using targets that provide a moderate SNR gain with respect to the input and therefore achieve a balance between speech component quality and noise suppression. We concatenate this single DNN several times without any retraining to provide enough noise attenuation. Simulation results show that our proposed CI-DNN outperforms enhancement methods using classical spectral weighting rules w.r.t. total speech quality and speech intelligibility. Moreover, our approach shows similar or even a little bit better performance with much fewer trainable parameters compared with a noisy-target single DNN approach
Authors
(none)
Tags
Stats
Related papers
- Deep Interaction Between Masking And Mapping Targets For Single-channel Speech Enhancement (2021)0.00
- Multi-modal Hybrid Deep Neural Network For Speech Enhancement (2016)0.00
- Incorporating Multi-target In Multi-stage Speech Enhancement Model For Better Generalization (2021)0.00
- A Dual-staged Context Aggregation Method Towards Efficient End-to-end Speech Enhancement (2019)0.00
- Consistency-aware Multi-channel Speech Enhancement Using Deep Neural Networks (2020)0.00
- DCCRN+: Channel-wise Subband DCCRN With SNR Estimation For Speech Enhancement (2021)0.00
- Real-time Monaural Speech Enhancement With Short-time Discrete Cosine Transform (2021)0.00
- Df-conformer: Integrated Architecture Of Conv-tasnet And Conformer Using Linear Complexity Self-attention For Speech Enhancement (2021)11.29