Multichannel Blind Speech Source Separation With A Disjoint Constraint Source Model
2024 · Jianyu Wang, Shanzheng Guan
Abstract
Multichannel convolutive blind speech source separation refers to the problem of separating different speech sources from the observed multichannel mixtures without much a priori information about the mixing system. Multichannel nonnegative matrix factorization (MNMF) has been proven to be one of the most powerful separation frameworks and the representative algorithms such as MNMF and the independent low-rank matrix analysis (ILRMA) have demonstrated great performance. However, the sparseness properties of speech source signals are not fully taken into account in such a framework. It is well known that speech signals are sparse in nature, which is considered in this work to improve the separation performance. Specifically, we utilize the Bingham and Laplace distributions to formulate a disjoint constraint regularizer, which is subsequently incorporated into both MNMF and ILRMA. We then derive majorization-minimization rules for updating parameters related to the source model, resultin
Authors
(none)
Tags
Stats
Related papers
- Determined Blind Source Separation Via Modeling Adjacent Frequency Band Correlations In Speech Signals (2025)0.00
- Determined Multichannel Blind Source Separation With Clustered Source Model (2024)0.00
- Data-driven Source Separation Based On Simplex Analysis (2018)0.00
- Multichannel Audio Source Separation With Independent Deeply Learned Matrix Analysis Using Product Of Source Models (2021)0.00
- Joint Sound Source Separation And Speaker Recognition (2016)4.52
- Generalized Multichannel Variational Autoencoder For Underdetermined Source Separation (2018)7.81
- Accelerated Convolutive Transfer Function-based Multichannel NMF Using Iterative Source Steering (2025)0.00
- Multichannel Singing Voice Separation By Deep Neural Network Informed DOA Constrained CNMF (2020)5.84