Singing Voice Synthesis Based On Convolutional Neural Networks
2019 Β· Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, et al.
Abstract
The present paper describes a singing voice synthesis based on convolutional neural networks (CNNs). Singing voice synthesis systems based on deep neural networks (DNNs) are currently being proposed and are improving the naturalness of synthesized singing voices. In these systems, the relationship between musical score feature sequences and acoustic feature sequences extracted from singing voices is modeled by DNNs. Then, an acoustic feature sequence of an arbitrary musical score is output in units of frames by the trained DNNs, and a natural trajectory of a singing voice is obtained by using a parameter generation algorithm. As singing voices contain rich expression, a powerful technique to model them accurately is required. In the proposed technique, long-term dependencies of singing voices are modeled by CNNs. An acoustic feature sequence is generated in units of segments that consist of long-term frames, and a natural trajectory is obtained without the parameter generation algorith
Authors
(none)
Tags
Stats
Related papers
- Fast And High-quality Singing Voice Synthesis System Based On Convolutional Neural Networks (2019)8.82
- Singing Voice Synthesis Using Deep Autoregressive Neural Networks For Acoustic Modeling (2019)9.92
- Unsupervised Singing Voice Conversion (2019)11.19
- Singgan: Generative Adversarial Network For High-fidelity Singing Voice Generation (2021)10.61
- NNSVS: A Neural Network-based Singing Voice Synthesis Toolkit (2022)13.83
- Wgansing: A Multi-voice Singing Voice Synthesizer Based On The Wasserstein-gan (2019)11.08
- Leveraging Symmetrical Convolutional Transformer Networks For Speech To Singing Voice Style Transfer (2022)5.84
- Generative Moment Matching Network-based Random Modulation Post-filter For Dnn-based Singing Voice Synthesis And Neural Double-tracking (2019)4.52