Schr\"odinger Bridge Mamba For One-step Speech Enhancement
2025 Β· Jing Yang, Sirui Wang, Chao Wu, et al.
Abstract
We present Schr\"odinger Bridge Mamba (SBM), a novel model for efficient speech enhancement by integrating the Schr\"odinger Bridge (SB) training paradigm and the Mamba architecture. Experiments of joint denoising and dereverberation tasks demonstrate SBM outperforms strong generative and discriminative methods on multiple metrics with only one step of inference while achieving a competitive real-time factor for streaming feasibility. Ablation studies reveal that the SB paradigm consistently yields improved performance across diverse architectures over conventional mapping. Furthermore, Mamba exhibits a stronger performance under the SB paradigm compared to Multi-Head Self-Attention (MHSA) and Long Short-Term Memory (LSTM) backbones. These findings highlight the synergy between the Mamba architecture and the SB trajectory-based training, providing a high-quality solution for real-world speech enhancement. Demo page: https://sbmse.github.io
Authors
(none)
Tags
Stats
Related papers
- An Investigation Of Incorporating Mamba For Speech Enhancement (2024)13.70
- Mamba-seunet: Mamba Unet For Monaural Speech Enhancement (2024)7.16
- Mamba-based Decoder-only Approach With Bidirectional Speech Modeling For Speech Recognition (2024)0.00
- An Exploration Of Mamba For Speech Self-supervised Models (2025)1.20
- Leveraging Joint Spectral And Spatial Learning With MAMBA For Multichannel Speech Enhancement (2024)0.00
- Diffusion-based Speech Enhancement With Schr\"odinger Bridge And Symmetric Noise Schedule (2024)0.00
- Dual-path Mamba: Short And Long-term Bidirectional Selective Structured State Space Models For Speech Separation (2024)4.12
- Improving Speech Enhancement By Cross- And Sub-band Processing With State Space Model (2025)3.58