Hyper-parameter Adaptation Of Conformer ASR Systems For Elderly And Dysarthric Speech Recognition
2023 Β· Tianzi Wang, Shoukang Hu, Jiajun Deng, et al.
Abstract
Automatic recognition of disordered and elderly speech remains highly challenging tasks to date due to data scarcity. Parameter fine-tuning is often used to exploit the large quantities of non-aged and healthy speech pre-trained models, while neural architecture hyper-parameters are set using expert knowledge and remain unchanged. This paper investigates hyper-parameter adaptation for Conformer ASR systems that are pre-trained on the Librispeech corpus before being domain adapted to the DementiaBank elderly and UASpeech dysarthric speech datasets. Experimental results suggest that hyper-parameter adaptation produced word error rate (WER) reductions of 0.45% and 0.67% over parameter-only fine-tuning on DBank and UASpeech tasks respectively. An intuitive correlation is found between the performance improvements by hyper-parameter domain adaptation and the relative utterance length ratio between the source and target domain data.
Authors
(none)
Tags
Stats
Related papers
- On-the-fly Feature Based Rapid Speaker Adaptation For Dysarthric And Elderly Speech Recognition (2022)6.34
- Speaker Adaptation Using Spectro-temporal Deep Features For Dysarthric And Elderly Speech Recognition (2022)12.02
- Residual Adapters For Parameter-efficient ASR Adaptation To Atypical And Accented Speech (2021)10.74
- Homogeneous Speaker Features For On-the-fly Dysarthric And Elderly Speaker Adaptation (2024)0.00
- Structured Speaker-deficiency Adaptation Of Foundation Models For Dysarthric And Elderly Speech Recognition (2024)0.00
- Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion And Householder Transformation (2023)0.00
- Enhancing Dysarthric Speech Recognition For Unseen Speakers Via Prototype-based Adaptation (2024)9.45
- Factorised Speaker-environment Adaptive Training Of Conformer Speech Recognition Systems (2023)0.00