Evaluating Standard And Dialectal Frisian ASR: Multilingual Fine-tuning And Language Identification For Improved Low-resource Performance
2025 · Reihaneh Amooie, Wietse de Vries, Yun Hao, et al.
Abstract
Automatic Speech Recognition (ASR) performance for low-resource languages is still far behind that of higher-resource languages such as English, due to a lack of sufficient labeled data. State-of-the-art methods deploy self-supervised transfer learning where a model pre-trained on large amounts of data is fine-tuned using little labeled data in a target low-resource language. In this paper, we present and examine a method for fine-tuning an SSL-based model in order to improve the performance for Frisian and its regional dialects (Clay Frisian, Wood Frisian, and South Frisian). We show that Frisian ASR performance can be improved by using multilingual (Frisian, Dutch, English and German) fine-tuning data and an auxiliary language identification task. In addition, our findings show that performance on dialectal speech suffers substantially, and, importantly, that this effect is moderated by the elicitation approach used to collect the dialectal data. Our findings also particularly sugges
Authors
(none)
Tags
Stats
Related papers
- Acoustic And Textual Data Augmentation For Improved ASR Of Code-switching Speech (2018)9.92
- Exploring The Impact Of Data Quantity On ASR In Extremely Low-resource Languages (2024)0.00
- Semi-supervised Acoustic Model Training For Speech With Code-switching (2018)7.81
- Fine-tuning Strategies For Faster Inference Using Speech Self-supervised Models: A Comparative Study (2023)8.35
- How To Learn A New Language? An Efficient Solution For Self-supervised Learning Models Unseen Languages Adaption In Low-resource Scenario (2024)0.00
- Performance Analysis Of Speech Encoders For Low-resource SLU And ASR In Tunisian Dialect (2024)4.52
- Parameter-efficient Adaptation Of Multilingual Multimodal Models For Low-resource ASR (2024)2.26
- Code-switching Detection With Data-augmented Acoustic And Language Models (2018)3.58