Significance Of Data Augmentation For Improving Cleft Lip And Palate Speech Recognition
2021 Β· Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, et al.
Abstract
The automatic recognition of pathological speech, particularly from children with any articulatory impairment, is a challenging task due to various reasons. The lack of available domain specific data is one such obstacle that hinders its usage for different speech-based applications targeting pathological speakers. In line with the challenge, in this work, we investigate a few data augmentation techniques to simulate training data for improving the children speech recognition considering the case of cleft lip and palate (CLP) speech. The augmentation techniques explored in this study, include vocal tract length perturbation (VTLP), reverberation, speaking rate, pitch modification, and speech feature modification using cycle consistent adversarial networks (CycleGAN). Our study finds that the data augmentation methods significantly improve the CLP speech recognition performance, which is more evident when we used feature modification using CycleGAN, VTLP and reverberation based methods.
Authors
(none)
Tags
Stats
Related papers
- Personalized Adversarial Data Augmentation For Dysarthric And Elderly Speech Recognition (2022)11.49
- Spectral Modification Based Data Augmentation For Improving End-to-end ASR For Children's Speech (2022)8.35
- Data Augmentation Methods For End-to-end Speech Recognition On Distant-talk Scenarios (2021)6.34
- Adversarial Data Augmentation Using VAE-GAN For Disordered Speech Recognition (2022)0.00
- You Do Not Need More Data: Improving End-to-end Speech Recognition By Text-to-speech Data Augmentation (2020)11.49
- LPC Augment: An Lpc-based ASR Data Augmentation Algorithm For Low And Zero-resource Children's Dialects (2022)7.81
- Improving Multimodal Speech Recognition By Data Augmentation And Speech Representations (2022)9.03
- Data Augmenting Contrastive Learning Of Speech Representations In The Time Domain (2020)12.81