Dbnet: A Dual-branch Network Architecture Processing On Spectrum And Waveform For Single-channel Speech Enhancement
2021 Β· Kanghao Zhang, Shulin He, Hao Li, et al.
Abstract
In real acoustic environment, speech enhancement is an arduous task to improve the quality and intelligibility of speech interfered by background noise and reverberation. Over the past years, deep learning has shown great potential on speech enhancement. In this paper, we propose a novel real-time framework called DBNet which is a dual-branch structure with alternate interconnection. Each branch incorporates an encoder-decoder architecture with skip connections. The two branches are responsible for spectrum and waveform modeling, respectively. A bridge layer is adopted to exchange information between the two branches. Systematic evaluation and comparison show that the proposed system substantially outperforms related algorithms under very challenging environments. And in INTERSPEECH 2021 Deep Noise Suppression (DNS) challenge, the proposed system ranks the top 8 in real-time track 1 in terms of the Mean Opinion Score (MOS) of the ITU-T P.835 framework.
Authors
(none)
Tags
Stats
Related papers
- Dmf-net: A Decoupling-style Multi-band Fusion Model For Full-band Speech Enhancement (2022)7.16
- Dbt-net: Dual-branch Federative Magnitude And Phase Estimation With Attention-in-attention Transformer For Monaural Speech Enhancement (2022)12.47
- Desnet: A Multi-channel Network For Simultaneous Speech Dereverberation, Enhancement And Separation (2020)9.59
- Dpt-fsnet: Dual-path Transformer Based Full-band And Sub-band Fusion Network For Speech Enhancement (2021)0.00
- Run-time Adaptation Of Neural Beamforming For Robust Speech Dereverberation And Denoising (2024)0.00
- Thlnet: Two-stage Heterogeneous Lightweight Network For Monaural Speech Enhancement (2023)0.00
- DBNET: Doa-driven Beamforming Network For End-to-end Farfield Sound Source Separation (2020)0.00
- Ednet: A Versatile Speech Enhancement Framework With Gating Mamba Mechanism And Phase Shift-invariant Training (2025)0.00