Multichannel Audio Source Separation With Independent Deeply Learned Matrix Analysis Using Product Of Source Models
2021 · Takuya Hasumi, Tomohiko Nakamura, Norihiro Takamune, et al.
Abstract
Independent deeply learned matrix analysis (IDLMA) is one of the state-of-the-art multichannel audio source separation methods using the source power estimation based on deep neural networks (DNNs). The DNN-based power estimation works well for sounds having timbres similar to the DNN training data. However, the sounds to which IDLMA is applied do not always have such timbres, and the timbral mismatch causes the performance degradation of IDLMA. To tackle this problem, we focus on a blind source separation counterpart of IDLMA, independent low-rank matrix analysis. It uses nonnegative matrix factorization (NMF) as the source model, which can capture source spectral components that only appear in the target mixture, using the low-rank structure of the source spectrogram as a clue. We thus extend the DNN-based source model to encompass the NMF-based source model on the basis of the product-of-expert concept, which we call the product of source models (PoSM). For the proposed PoSM-based I
Authors
(none)
Tags
Stats
Related papers
- Determined Multichannel Blind Source Separation With Clustered Source Model (2024)0.00
- Generalized Multichannel Variational Autoencoder For Underdetermined Source Separation (2018)7.81
- Determined Blind Source Separation Via Modeling Adjacent Frequency Band Correlations In Speech Signals (2025)0.00
- Multichannel Blind Speech Source Separation With A Disjoint Constraint Source Model (2024)0.00
- Interleaved Multitask Learning For Audio Source Separation With Independent Databases (2019)0.00
- Multichannel Singing Voice Separation By Deep Neural Network Informed DOA Constrained CNMF (2020)5.84
- Data-driven Source Separation Based On Simplex Analysis (2018)0.00
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models (2022)10.97