Multi-singer: Fast Multi-singer Singing Voice Vocoder With A Large-scale Corpus
2021 Β· Rongjie Huang, Feiyang Chen, Yi Ren, et al.
Abstract
High-fidelity multi-singer singing voice synthesis is challenging for neural vocoder due to the singing voice data shortage, limited singer generalization, and large computational cost. Existing open corpora could not meet requirements for high-fidelity singing voice synthesis because of the scale and quality weaknesses. Previous vocoders have difficulty in multi-singer modeling, and a distinct degradation emerges when conducting unseen singer singing voice generation. To accelerate singing voice researches in the community, we release a large-scale, multi-singer Chinese singing voice dataset OpenSinger. To tackle the difficulty in unseen singer modeling, we propose Multi-Singer, a fast multi-singer vocoder with generative adversarial networks. Specifically, 1) Multi-Singer uses a multi-band generator to speed up both training and inference procedure. 2) to capture and rebuild singer identity from the acoustic feature (i.e., mel-spectrogram), Multi-Singer adopts a singer conditional di
Authors
(none)
Tags
Stats
Related papers
- Adversarially Trained Multi-singer Sequence-to-sequence Singing Synthesizer (2020)7.81
- Singgan: Generative Adversarial Network For High-fidelity Singing Voice Generation (2021)10.61
- Xiaoicesing 2: A High-fidelity Singing Voice Synthesizer Based On Generative Adversarial Network (2022)0.00
- Wgansing: A Multi-voice Singing Voice Synthesizer Based On The Wasserstein-gan (2019)11.08
- Singing Voice Data Scaling-up: An Introduction To Ace-opencpop And Ace-kising (2024)15.48
- Opencpop: A High-quality Open Source Chinese Popular Song Corpus For Singing Voice Synthesis (2022)13.34
- Bisinger: Bilingual Singing Voice Synthesis (2023)2.26
- Mandarin Singing Voice Synthesis With Denoising Diffusion Probabilistic Wasserstein GAN (2022)6.34