Robustsvc: Hubert-based Melody Extractor And Adversarial Learning For Robust Singing Voice Conversion
2024 Β· Wei Chen, Xintao Zhao, Jun Chen, et al.
Abstract
Singing voice conversion (SVC) is hindered by noise sensitivity due to the use of non-robust methods for extracting pitch and energy during the inference. As clean signals are key for the source audio in SVC, music source separation preprocessing offers a viable solution for handling noisy audio, like singing with background music (BGM). However, current separating methods struggle to fully remove noise or excessively suppress signal components, affecting the naturalness and similarity of the processed audio. To tackle this, our study introduces RobustSVC, a novel any-to-one SVC framework that converts noisy vocals into clean vocals sung by the target singer. We replace the non-robust feature with a HuBERT-based melody extractor and use adversarial training mechanisms with three discriminators to reduce information leakage in self-supervised representations. Experimental results show that RobustSVC is noise-robust and achieves higher similarity and naturalness than baseline methods in
Authors
(none)
Tags
Stats
Related papers
- Robust One-shot Singing Voice Conversion (2022)0.00
- Singing Voice Conversion With Accompaniment Using Self-supervised Representation-based Melody Features (2025)0.00
- Ppg-based Singing Voice Conversion With Adversarial Representation Learning (2020)9.76
- Leveraging Diverse Semantic-based Audio Pretrained Models For Singing Voice Conversion (2023)0.00
- Neural Concatenative Singing Voice Conversion: Rethinking Concatenation-based Approach For One-shot Singing Voice Conversion (2023)7.50
- Toward Expressive Singing Voice Correction: On Perceptual Validity Of Evaluation Metrics For Vocal Melody Extraction (2020)0.00
- Knn-svc: Robust Zero-shot Singing Voice Conversion With Additive Synthesis And Concatenation Smoothness Optimization (2025)5.87
- LHQ-SVC: Lightweight And High Quality Singing Voice Conversion Modeling (2024)3.58