Spectral Modification Based Data Augmentation For Improving End-to-end ASR For Children's Speech
2022 Β· Vishwanath Pratap Singh, Hardik Sailor, Supratik Bhattacharya, et al.
Abstract
Training a robust Automatic Speech Recognition (ASR) system for children's speech recognition is a challenging task due to inherent differences in acoustic attributes of adult and child speech and scarcity of publicly available children's speech dataset. In this paper, a novel segmental spectrum warping and perturbations in formant energy are introduced, to generate a children-like speech spectrum from that of an adult's speech spectrum. Then, this modified adult spectrum is used as augmented data to improve end-to-end ASR systems for children's speech recognition. The proposed data augmentation methods give 6.5% and 6.1% relative reduction in WER on children dev and test sets respectively, compared to the vocal tract length perturbation (VTLP) baseline system trained on Librispeech 100 hours adult speech dataset. When children's speech data is added in training with Librispeech set, it gives a 3.7 % and 5.1% relative reduction in WER, compared to the VTLP baseline system.
Authors
(none)
Tags
Stats
Related papers
- LPC Augment: An Lpc-based ASR Data Augmentation Algorithm For Low And Zero-resource Children's Dialects (2022)7.81
- Fundamental Frequency Feature Normalization And Data Augmentation For Child Speech Recognition (2021)8.09
- Improving Child Speech Recognition With Augmented Child-like Speech (2024)5.24
- Significance Of Data Augmentation For Improving Cleft Lip And Palate Speech Recognition (2021)0.00
- Using Data Augmentations And VTLN To Reduce Bias In Dutch End-to-end Speech Recognition Systems (2023)0.00
- Phaseperturbation: Speech Data Augmentation Via Phase Perturbation For Automatic Speech Recognition (2023)0.00
- Transfer Learning For Robust Low-resource Children's Speech ASR With Transformers And Source-filter Warping (2022)6.77
- Personalized Adversarial Data Augmentation For Dysarthric And Elderly Speech Recognition (2022)11.49