Adversarial Synthesis Based Data-augmentation For Code-switched Spoken Language Identification
2022 Β· Parth Shastri, Chirag Patil, Poorval Wanere, et al.
Abstract
Spoken Language Identification (LID) is an important sub-task of Automatic Speech Recognition(ASR) that is used to classify the language(s) in an audio segment. Automatic LID plays an useful role in multilingual countries. In various countries, identifying a language becomes hard, due to the multilingual scenario where two or more than two languages are mixed together during conversation. Such phenomenon of speech is called as code-mixing or code-switching. This nature is followed not only in India but also in many Asian countries. Such code-mixed data is hard to find, which further reduces the capabilities of the spoken LID. Hence, this work primarily addresses this problem using data augmentation as a solution on the on the data scarcity of the code-switched class. This study focuses on Indic language code-mixed with English. Spoken LID is performed on Hindi, code-mixed with English. This research proposes Generative Adversarial Network (GAN) based data augmentation technique perform
Authors
(none)
Tags
Stats
Related papers
- Code-switching Sentence Generation By Generative Adversarial Networks And Its Application To Data Augmentation (2018)0.00
- Data Augmentation For End-to-end Code-switching Speech Recognition (2020)9.92
- Joint Language Identification Of Code-switching Speech Using Attention Based E2E Network (2019)5.24
- Personalized Adversarial Data Augmentation For Dysarthric And Elderly Speech Recognition (2022)11.49
- Investigating Lexical Replacements For Arabic-english Code-switched Data Augmentation (2022)5.84
- Data Augmentation For Spoken Language Understanding Via Joint Variational Generation (2018)10.61
- Unified Model For Code-switching Speech Recognition And Language Identification Based On A Concatenated Tokenizer (2023)8.09
- Textual Data Augmentation For Arabic-english Code-switching Speech Recognition (2022)6.77