GCI Detection From Raw Speech Using A Fully-convolutional Network
2019 Β· Luc Ardaillon, Axel Roebel
Abstract
Glottal Closure Instants (GCI) detection consists in automatically detecting temporal locations of most significant excitation of the vocal tract from the speech signal. It is used in many speech analysis and processing applications, and various algorithms have been proposed for this purpose. Recently, new approaches using convolutional neural networks have emerged, with encouraging results. Following this trend, we propose a simple approach that performs a mapping from the speech waveform to a target signal from which the GCIs are obtained by peak-picking. However, the ground truth GCIs used for training and evaluation are usually extracted from EGG signals, which are not perfectly reliable and often not available. To overcome this problem, we propose to train our network on high-quality synthetic speech with perfect ground truth. The performances of the proposed algorithm are compared with three other state-of-the-art approaches using publicly available datasets, and the impact of us
Authors
(none)
Tags
Stats
Related papers
- Detection Of Glottal Closure Instants From Raw Speech Using Convolutional Neural Networks (2018)0.00
- Detection Of Glottal Closure Instants From Speech Signals: A Quantitative Review (2019)16.88
- Furcanet: An End-to-end Deep Gated Convolutional, Long Short-term Memory, Deep Neural Networks For Single Channel Speech Separation (2019)0.00
- Glottal Source Estimation Robustness: A Comparison Of Sensitivity Of Voice Source Estimation Techniques (2020)0.00
- Light Convolutional Neural Network With Feature Genuinization For Detection Of Synthetic Speech Attacks (2020)13.97
- Speaker-independent Raw Waveform Model For Glottal Excitation (2018)9.76
- Letter-based Speech Recognition With Gated Convnets (2017)0.00
- Reconstructing Speech From Real-time Articulatory MRI Using Neural Vocoders (2021)0.00