LHQ-SVC: Lightweight And High Quality Singing Voice Conversion Modeling
2024 Β· Yubo Huang, Xin Lai, Muyang Ye, et al.
Abstract
Singing Voice Conversion (SVC) has emerged as a significant subfield of Voice Conversion (VC), enabling the transformation of one singer's voice into another while preserving musical elements such as melody, rhythm, and timbre. Traditional SVC methods have limitations in terms of audio quality, data requirements, and computational complexity. In this paper, we propose LHQ-SVC, a lightweight, CPU-compatible model based on the SVC framework and diffusion model, designed to reduce model size and computational demand without sacrificing performance. We incorporate features to improve inference quality, and optimize for CPU execution by using performance tuning tools and parallel computing frameworks. Our experiments demonstrate that LHQ-SVC maintains competitive performance, with significant improvements in processing speed and efficiency across different devices. The results suggest that LHQ-SVC can meet
Authors
(none)
Tags
Stats
Related papers
- LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion With Inference Acceleration Via Latent Consistency Distillation (2024)3.58
- Fastsvc: Fast Cross-domain Singing Voice Conversion With Feature-wise Linear Modulation (2020)11.08
- LDM-SVC: Latent Diffusion Model Based Zero-shot Any-to-any Singing Voice Conversion With Singer Guidance (2024)5.84
- Leveraging Diverse Semantic-based Audio Pretrained Models For Singing Voice Conversion (2023)0.00
- Neural Concatenative Singing Voice Conversion: Rethinking Concatenation-based Approach For One-shot Singing Voice Conversion (2023)7.50
- Lifter Training And Sub-band Modeling For Computationally Efficient And High-quality Voice Conversion Using Spectral Differentials (2020)0.00
- Robustsvc: Hubert-based Melody Extractor And Adversarial Learning For Robust Singing Voice Conversion (2024)3.58
- Zero-shot Sing Voice Conversion: Built Upon Clustering-based Phoneme Representations (2024)0.00