Independence-based Joint Dereverberation And Separation With Neural Source Model
2021 Β· Kohei Saijo, Robin Scheibler
Abstract
We propose an independence-based joint dereverberation and separation method with a neural source model. We introduce a neural network in the framework of time-decorrelation iterative source steering, which is an extension of independent vector analysis to joint dereverberation and separation. The network is trained in an end-to-end manner with a permutation invariant loss on the time-domain separation output signals. Our proposed method can be applied in any situation with at least as many microphones as sources, regardless of their number. In experiments, we demonstrate that our method results in high performance in terms of both speech quality metrics and word error rate (WER), even for mixtures with a different number of speakers than training. Furthermore, the model, trained on synthetic mixtures, without any modifications, greatly reduces the WER on the recorded dataset LibriCSS.
Authors
(none)
Tags
Stats
Related papers
- Interleaved Multitask Learning For Audio Source Separation With Independent Databases (2019)0.00
- Single-channel Speech Separation With Auxiliary Speaker Embeddings (2019)0.00
- Audio-visual Speech Separation And Dereverberation With A Two-stage Multimodal Network (2019)12.47
- End-to-end Networks For Supervised Single-channel Speech Separation (2018)0.00
- A Comparison And Combination Of Unsupervised Blind Source Separation Techniques (2021)0.00
- Neural Blind Source Separation And Diarization For Distant Speech Recognition (2024)0.00
- Monaural Source Separation: From Anechoic To Reverberant Environments (2021)10.61
- Spatial Loss For Unsupervised Multi-channel Source Separation (2022)7.16