A Comparison And Combination Of Unsupervised Blind Source Separation Techniques
2021 Β· Christoph Boeddeker, Frederik Rautenberg, Reinhold Haeb-Umbach
Abstract
Unsupervised blind source separation methods do not require a training phase and thus cannot suffer from a train-test mismatch, which is a common concern in neural network based source separation. The unsupervised techniques can be categorized in two classes, those building upon the sparsity of speech in the Short-Time Fourier transform domain and those exploiting non-Gaussianity or non-stationarity of the source signals. In this contribution, spatial mixture models which fall in the first category and independent vector analysis (IVA) as a representative of the second category are compared w.r.t. their separation performance and the performance of a downstream speech recognizer on a reverberant dataset of reasonable size. Furthermore, we introduce a serial concatenation of the two, where the result of the mixture model serves as initialization of IVA, which achieves significantly better WER performance than each algorithm individually and even approaches the performance of a much more
Authors
(none)
Tags
Stats
Related papers
- Independence-based Joint Dereverberation And Separation With Neural Source Model (2021)4.52
- End-to-end Networks For Supervised Single-channel Speech Separation (2018)0.00
- Surrogate Source Model Learning For Determined Source Separation (2020)9.59
- Determined Blind Source Separation Via Modeling Adjacent Frequency Band Correlations In Speech Signals (2025)0.00
- Single-channel Blind Source Separation For Singing Voice Detection: A Comparative Study (2018)0.00
- A Style Transfer Approach To Source Separation (2019)3.58
- Target Speech Extraction Based On Blind Source Separation And X-vector-based Speaker Selection Trained With Data Augmentation (2020)0.00
- Deep Bayesian Unsupervised Source Separation Based On A Complex Gaussian Mixture Model (2019)6.34