Recursive Speech Separation For Unknown Number Of Speakers
2019 Β· Naoya Takahashi, Sudarsanam Parthasaarathy, Nabarun Goswami, et al.
Abstract
In this paper we propose a method of single-channel speaker-independent multi-speaker speech separation for an unknown number of speakers. As opposed to previous works, in which the number of speakers is assumed to be known in advance and speech separation models are specific for the number of speakers, our proposed method can be applied to cases with different numbers of speakers using a single model by recursively separating a speaker. To make the separation model recursively applicable, we propose one-and-rest permutation invariant training (OR-PIT). Evaluation on WSJ0-2mix and WSJ0-3mix datasets show that our proposed method achieves state-of-the-art results for two- and three-speaker mixtures with a single model. Moreover, the same model can separate four-speaker mixture, which was never seen during the training. We further propose the detection of the number of speakers in a mixture during recursive separation and show that this approach can more accurately estimate the number of
Authors
(none)
Tags
Stats
Related papers
- Coarse-to-fine Recursive Speech Separation For Unknown Number Of Speakers (2022)0.00
- Boosting Unknown-number Speaker Separation With Transformer Decoder-based Attractor (2024)0.00
- Multiple Choice Learning For Efficient Speech Separation With Many Speakers (2024)2.26
- Permutation Invariant Training Of Deep Models For Speaker-independent Multi-talker Speech Separation (2016)0.00
- Sepit: Approaching A Single Channel Speech Separation Bound (2022)10.35
- EEND-SS: Joint End-to-end Neural Speaker Diarization And Speech Separation For Flexible Number Of Speakers (2022)10.35
- Monaural Multi-speaker Speech Separation Using Efficient Transformer Model (2023)0.00
- Mask-dependent Phase Estimation For Monaural Speaker Separation (2019)6.34