Stabilizing Label Assignment For Speech Separation By Self-supervised Pre-training
2020 Β· Sung-Feng Huang, Shun-Po Chuang, da-Rong Liu, et al.
Abstract
Speech separation has been well developed, with the very successful permutation invariant training (PIT) approach, although the frequent label assignment switching happening during PIT training remains to be a problem when better convergence speed and achievable performance are desired. In this paper, we propose to perform self-supervised pre-training to stabilize the label assignment in training the speech separation model. Experiments over several types of self-supervised approaches, several typical speech separation models and two different datasets showed that very good improvements are achievable if a proper self-supervised approach is chosen.
Authors
(none)
Tags
Stats
Related papers
- Progressive Learning For Stabilizing Label Selection In Speech Separation With Mapping-based Method (2021)0.00
- Improving Label Assignments Learning By Dynamic Sample Dropout Combined With Layer-wise Optimization In Speech Separation (2023)0.00
- Interrupted And Cascaded Permutation Invariant Training For Speech Separation (2019)4.52
- Single-channel Speech Separation Using Soft-minimum Permutation Invariant Training (2021)2.26
- Probabilistic Permutation Invariant Training For Speech Separation (2019)7.81
- Permutation Invariant Training Of Deep Models For Speaker-independent Multi-talker Speech Separation (2016)0.00
- Separating Long-form Speech With Group-wise Permutation Invariant Training (2021)4.52
- An Adapter Based Multi-label Pre-training For Speech Separation And Enhancement (2022)7.50