The Deterministic Plus Stochastic Model Of The Residual Signal And Its Applications
2019 Β· Thomas Drugman, Thierry Dutoit
Abstract
The modeling of speech production often relies on a source-filter approach. Although methods parameterizing the filter have nowadays reached a certain maturity, there is still a lot to be gained for several speech processing applications in finding an appropriate excitation model. This manuscript presents a Deterministic plus Stochastic Model (DSM) of the residual signal. The DSM consists of two contributions acting in two distinct spectral bands delimited by a maximum voiced frequency. Both components are extracted from an analysis performed on a speaker-dependent dataset of pitch-synchronous residual frames. The deterministic part models the low-frequency contents and arises from an orthonormal decomposition of these frames. As for the stochastic component, it is a high-frequency noise modulated both in time and frequency. Some interesting phonetic and computational properties of the DSM are also highlighted. The applicability of the DSM in two fields of speech processing is then stu
Authors
(none)
Tags
Stats
Related papers
- Neuraldps: Neural Deterministic Plus Stochastic Model With Multiband Excitation For Noise-controllable Waveform Generation (2022)0.00
- Storm: A Diffusion-based Stochastic Regeneration Model For Speech Enhancement And Dereverberation (2022)15.43
- Disentangling Speech And Non-speech Components For Building Robust Acoustic Models From Found Data (2019)0.00
- Speech Decomposition Based On A Hybrid Speech Model And Optimal Segmentation (2021)0.00
- Using A Pitch-synchronous Residual Codebook For Hybrid Hmm/frame Selection Speech Synthesis (2019)9.41
- Beyond Oversmoothing: Evaluating DDPM And MSE For Scalable Speech Synthesis In ASR (2024)0.00
- Rnn-based Speech Synthesis Using A Continuous Sinusoidal Model (2019)3.58
- Resgrad: Residual Denoising Diffusion Probabilistic Models For Text To Speech (2022)0.00