Confidence Score Based Speaker Adaptation Of Conformer Speech Recognition Systems
2023 Β· Jiajun Deng, Xurong Xie, Tianzi Wang, et al.
Abstract
Speaker adaptation techniques provide a powerful solution to customise automatic speech recognition (ASR) systems for individual users. Practical application of unsupervised model-based speaker adaptation techniques to data intensive end-to-end ASR systems is hindered by the scarcity of speaker-level data and performance sensitivity to transcription errors. To address these issues, a set of compact and data efficient speaker-dependent (SD) parameter representations are used to facilitate both speaker adaptive training and test-time unsupervised speaker adaptation of state-of-the-art Conformer ASR systems. The sensitivity to supervision quality is reduced using a confidence score-based selection of the less erroneous subset of speaker-level adaptation data. Two lightweight confidence score estimation modules are proposed to produce more reliable confidence scores. The data sparsity issue, which is exacerbated by data selection, is addressed by modelling the SD parameter uncertainty usin
Authors
(none)
Tags
Stats
Related papers
- Confidence Score Based Conformer Speaker Adaptation For Speech Recognition (2022)8.09
- Factorised Speaker-environment Adaptive Training Of Conformer Speech Recognition Systems (2023)0.00
- Unsupervised Model-based Speaker Adaptation Of End-to-end Lattice-free MMI Model For Speech Recognition (2022)2.26
- Bayesian Learning For Deep Neural Network Adaptation (2020)9.76
- On-the-fly Feature Based Rapid Speaker Adaptation For Dysarthric And Elderly Speech Recognition (2022)6.34
- A Unified Speaker Adaptation Method For Speech Synthesis Using Transcribed And Untranscribed Speech With Backpropagation (2019)0.00
- Adversarial Speaker Adaptation (2019)10.21
- Structured Speaker-deficiency Adaptation Of Foundation Models For Dysarthric And Elderly Speech Recognition (2024)0.00