Complex NMF Under Phase Constraints Based On Signal Modeling: Application To Audio Source Separation
2016 Β· Paul Magron, Roland Badeau, Bertrand David
Abstract
Nonnegative Matrix Factorization (NMF) is a powerful tool for decomposing mixtures of audio signals in the Time-Frequency (TF) domain. In the source separation framework, the phase recovery for each extracted component is necessary for synthesizing time-domain signals. The Complex NMF (CNMF) model aims to jointly estimate the spectrogram and the phase of the sources, but requires to constrain the phase in order to produce satisfactory sounding results. We propose to incorporate phase constraints based on signal models within the CNMF framework: a \textit\{phase unwrapping\} constraint that enforces a form of temporal coherence, and a constraint based on the \textit\{repetition\} of audio events, which models the phases of the sources within onset frames. We also provide an algorithm for estimating the model parameters. The experimental results highlight the interest of including such constraints in the CNMF framework for separating overlapping components in complex audio mixtures.
Authors
(none)
Tags
Stats
Related papers
- End-to-end Non-negative Autoencoders For Sound Source Separation (2019)2.26
- Joint Sound Source Separation And Speaker Recognition (2016)4.52
- Neural Network Alternatives To Convolutive Audio Models For Source Separation (2017)0.00
- Determined Multichannel Blind Source Separation With Clustered Source Model (2024)0.00
- Accelerated Convolutive Transfer Function-based Multichannel NMF Using Iterative Source Steering (2025)0.00
- Generalized Multichannel Variational Autoencoder For Underdetermined Source Separation (2018)7.81
- Spectrogram Inversion For Audio Source Separation Via Consistency, Mixing, And Magnitude Constraints (2023)0.00
- On Ambisonic Source Separation With Spatially Informed Non-negative Tensor Factorization (2025)4.52