Boosting Active Learning For Speech Recognition With Noisy Pseudo-labeled Samples
2020 Β· Jihwan Bang, Heesu Kim, Youngjoon Yoo, et al.
Abstract
The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training pipeline boosting the conventional active learning approach targeting label-efficient learning to resolve the mentioned problem. Existing active learning methods only focus on selecting a set of informative samples under a labeling budget. One step further, we suggest that the training efficiency can be further improved by utilizing the unlabeled samples, exceeding the labeling budget, by introducing sophisticatedly configured unsupervised loss complementing supervised loss effectively. We propose new unsupervised loss based on consistency regularization, and we configure appropriate augmentation techniques for utterances to adopt consistency regularization in the automatic speech recognition task. From the qualitative and quantitative experiments on the re
Authors
(none)
Tags
Stats
Related papers
- Unsupervised Active Learning: Optimizing Labeling Cost-effectiveness For Automatic Speech Recognition (2023)0.00
- Loss Prediction: End-to-end Active Learning Approach For Speech Recognition (2021)7.16
- Active Learning Of Non-semantic Speech Tasks With Pretrained Models (2022)2.26
- Joint Speech Transcription And Translation: Pseudo-labeling With Out-of-distribution Data (2022)0.00
- Alternative Pseudo-labeling For Semi-supervised Automatic Speech Recognition (2023)10.48
- Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask (2021)5.84
- Personalized Speech Enhancement Through Self-supervised Data Augmentation And Purification (2021)9.92
- Boosting Noise Robustness Of Acoustic Model Via Deep Adversarial Training (2018)9.23