Self-remixing: Unsupervised Speech Separation Via Separation And Remixing
2022 Β· Kohei Saijo, Tetsuji Ogawa
Abstract
We present Self-Remixing, a novel self-supervised speech separation method, which refines a pre-trained separation model in an unsupervised manner. The proposed method consists of a shuffler module and a solver module, and they grow together through separation and remixing processes. Specifically, the shuffler first separates observed mixtures and makes pseudo-mixtures by shuffling and remixing the separated signals. The solver then separates the pseudo-mixtures and remixes the separated signals back to the observed mixtures. The solver is trained using the observed mixtures as supervision, while the shuffler's weights are updated by taking the moving average with the solver's, generating the pseudo-mixtures with fewer distortions. Our experiments demonstrate that Self-Remixing gives better performance over existing remixing-based self-supervised methods with the same or less training costs under unsupervised setup. Self-Remixing also outperforms baselines in semi-supervised domain ada
Authors
(none)
Tags
Stats
Related papers
- Continual Self-training With Bootstrapped Remixing For Speech Enhancement (2021)7.81
- Remixit: Continual Self-training Of Speech Enhancement Models Via Bootstrapped Remixing (2022)12.47
- Remix-cycle-consistent Learning On Adversarially Learned Separator For Accurate And Stable Unsupervised Speech Separation (2022)3.58
- Mixcycle: Unsupervised Speech Separation Via Cyclic Mixture Permutation Invariant Training (2022)6.34
- Teacher-student Mixit For Unsupervised And Semi-supervised Speech Separation (2021)9.03
- Improved Singing Voice Separation With Chromagram-based Pitch-aware Remixing (2022)7.50
- Exploring The Integration Of Speech Separation And Recognition With Self-supervised Learning Representation (2023)6.34
- Remixed2remixed: Domain Adaptation For Speech Enhancement By Noise2noise Learning With Remixing (2023)5.24