Spatial Loss For Unsupervised Multi-channel Source Separation
2022 Β· Kohei Saijo, Robin Scheibler
Abstract
We propose a spatial loss for unsupervised multi-channel source separation. The proposed loss exploits the duality of direction of arrival (DOA) and beamforming: the steering and beamforming vectors should be aligned for the target source, but orthogonal for interfering ones. The spatial loss encourages consistency between the mixing and demixing systems from a classic DOA estimator and a neural separator, respectively. With the proposed loss, we train the neural separators based on minimum variance distortionless response (MVDR) beamforming and independent vector analysis (IVA). We also investigate the effectiveness of combining our spatial loss and a signal loss, which uses the outputs of blind source separation as the reference. We evaluate our proposed method on synthetic and recorded (LibriCSS) mixtures. We find that the spatial loss is most effective to train IVA-based separators. For the neural MVDR beamformer, it performs best when combined with a signal loss. On synthetic mixt
Authors
(none)
Tags
Stats
Related papers
- Multichannel Loss Function For Supervised Speech Source Separation By Mask-based Beamforming (2019)7.50
- Locate And Beamform: Two-dimensional Locating All-neural Beamformer For Multi-channel Speech Separation (2023)3.58
- Independence-based Joint Dereverberation And Separation With Neural Source Model (2021)4.52
- 3D Neural Beamforming For Multi-channel Speech Separation Against Location Uncertainty (2023)0.00
- Deep Bayesian Unsupervised Source Separation Based On A Complex Gaussian Mixture Model (2019)6.34
- Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters (2023)10.35
- Unsupervised Training For Deep Speech Source Separation With Kullback-leibler Divergence Based Probabilistic Loss Function (2019)9.92
- Convolutive Transfer Function Invariant SDR Training Criteria For Multi-channel Reverberant Speech Separation (2020)0.00