Leveraging Data Collection And Unsupervised Learning For Code-switched Tunisian Arabic Automatic Speech Recognition
2023 Β· Ahmed Amine Ben Abdallah, Ata Kabboudi, Amir Kanoun, et al.
Abstract
Crafting an effective Automatic Speech Recognition (ASR) solution for dialects demands innovative approaches that not only address the data scarcity issue but also navigate the intricacies of linguistic diversity. In this paper, we address the aforementioned ASR challenge, focusing on the Tunisian dialect. First, textual and audio data is collected and in some cases annotated. Second, we explore self-supervision, semi-supervision and few-shot code-switching approaches to push the state-of-the-art on different Tunisian test sets; covering different acoustic, linguistic and prosodic conditions. Finally, and given the absence of conventional spelling, we produce a human evaluation of our transcripts to avoid the noise coming from spelling inadequacies in our testing references. Our models, allowing to transcribe audio samples in a linguistic mix involving Tunisian Arabic, English and French, and all the data used during training and testing are released for public use and further improvem
Authors
(none)
Tags
Stats
Related papers
- Linto Audio And Textual Datasets To Train And Evaluate Automatic Speech Recognition In Tunisian Arabic Dialect (2025)0.00
- Dialectal Coverage And Generalization In Arabic Speech Recognition (2024)4.52
- Textual Data Augmentation For Arabic-english Code-switching Speech Recognition (2022)6.77
- Towards One Model To Rule All: Multilingual Strategy For Dialectal Code-switching Arabic ASR (2021)9.03
- Performance Analysis Of Speech Encoders For Low-resource SLU And ASR In Tunisian Dialect (2024)4.52
- Acoustic And Textual Data Augmentation For Improved ASR Of Code-switching Speech (2018)9.92
- Semi-supervised Development Of ASR Systems For Multilingual Code-switched Speech In Under-resourced Languages (2020)0.00
- Code-switching Speech Recognition Under The Lens: Model- And Data-centric Perspectives (2025)0.00