Using Heterogeneity In Semi-supervised Transcription Hypotheses To Improve Code-switched Speech Recognition
2021 Β· Andrew Slottje, Shannon Wotherspoon, William Hartmann, et al.
Abstract
Modeling code-switched speech is an important problem in automatic speech recognition (ASR). Labeled code-switched data are rare, so monolingual data are often used to model code-switched speech. These monolingual data may be more closely matched to one of the languages in the code-switch pair. We show that such asymmetry can bias prediction toward the better-matched language and degrade overall model performance. To address this issue, we propose a semi-supervised approach for code-switched ASR. We consider the case of English-Mandarin code-switching, and the problem of using monolingual data to build bilingual "transcription models'' for annotation of unlabeled code-switched data. We first build multiple transcription models so that their individual predictions are variously biased toward either English or Mandarin. We then combine these biased transcriptions using confidence-based selection. This strategy generates a superior transcript for semi-supervised training, and obtains a 19
Authors
(none)
Tags
Stats
Related papers
- Language Modeling For Code-switching: Evaluation, Integration Of Monolingual Data, And Discriminative Training (2018)5.24
- Code-switching Speech Recognition Under The Lens: Model- And Data-centric Perspectives (2025)0.00
- Constrained Output Embeddings For End-to-end Code-switching Speech Recognition With Only Monolingual Data (2019)7.16
- Data Augmentation For End-to-end Code-switching Speech Recognition (2020)9.92
- Generative Error Correction For Code-switching Speech Recognition Using Large Language Models (2023)0.00
- Towards End-to-end Code-switching Speech Recognition (2018)0.00
- Semi-supervised Development Of ASR Systems For Multilingual Code-switched Speech In Under-resourced Languages (2020)0.00
- Integrating Knowledge In End-to-end Automatic Speech Recognition For Mandarin-english Code-switching (2021)5.24