Quasi-periodic Wavenet Vocoder: A Pitch Dependent Dilated Convolution Model For Parametric Speech Generation
2019 Β· Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, et al.
Abstract
In this paper, we propose a quasi-periodic neural network (QPNet) vocoder with a novel network architecture named pitch-dependent dilated convolution (PDCNN) to improve the pitch controllability of WaveNet (WN) vocoder. The effectiveness of the WN vocoder to generate high-fidelity speech samples from given acoustic features has been proved recently. However, because of the fixed dilated convolution and generic network architecture, the WN vocoder hardly generates speech with given F0 values which are outside the range observed in training data. Consequently, the WN vocoder lacks the pitch controllability which is one of the essential capabilities of conventional vocoders. To address this limitation, we propose the PDCNN component which has the time-variant adaptive dilation size related to the given F0 values and a cascade network structure of the QPNet vocoder to generate quasi-periodic signals such as speech. Both objective and subjective tests are conducted, and the experimental res
Authors
(none)
Tags
Stats
Related papers
- Quasi-periodic Parallel Wavegan Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model For Parametric Speech Generation (2020)3.58
- Statistical Voice Conversion With Quasi-periodic Wavenet Vocoder (2019)3.58
- Periodgrad: Towards Pitch-controllable Neural Vocoder Based On A Diffusion Probabilistic Model (2024)0.00
- Non-parallel Voice Conversion System With Wavenet Vocoder And Collapsed Speech Suppression (2020)3.58
- Puffin: Pitch-synchronous Neural Waveform Generation For Fullband Speech On Modest Devices (2022)3.58
- Lp-wavenet: Linear Prediction-based Wavenet Speech Synthesis (2018)0.00
- Wavefit: An Iterative And Non-autoregressive Neural Vocoder Based On Fixed-point Iteration (2022)9.41
- Excitnet Vocoder: A Neural Excitation Model For Parametric Speech Synthesis Systems (2018)9.76