Joint Language Identification Of Code-switching Speech Using Attention Based E2E Network
2019 Β· Sreeram Ganji, Kunal Dhawan, Kumar Priyadarshi, et al.
Abstract
Language identification (LID) has relevance in many speech processing applications. For the automatic recognition of code-switching speech, the conventional approaches often employ an LID system for detecting the languages present within an utterance. In the existing works, the LID on code-switching speech involves modelling of the underlying languages separately. In this work, we propose a joint modelling based LID system for code-switching speech. To achieve the same, an attention-based end-to-end (E2E) network has been explored. For the development and evaluation of the proposed approach, a recently created Hindi-English code-switching corpus has been used. For the contrast purpose, an LID system employing the connectionist temporal classification-based E2E network is also developed. On comparing both the LID systems, the attention based approach is noted to result in better LID accuracy. The effective location of code-switching boundaries within the utterance by the proposed approa
Authors
(none)
Tags
Stats
Related papers
- Investigating Target Set Reduction For End-to-end Speech Recognition Of Hindi-english Code-switching Data (2019)5.84
- Is Attention Always Needed? A Case Study On Language Identification From Speech (2021)2.26
- Adversarial Synthesis Based Data-augmentation For Code-switched Spoken Language Identification (2022)0.00
- Streaming End-to-end Bilingual ASR Systems With Joint Language Identification (2020)0.00
- Spoken Language Identification System For English-mandarin Code-switching Child-directed Speech (2023)4.52
- End-to-end Language Identification Using Multi-head Self-attention And 1D Convolutional Neural Networks (2021)0.00
- Leveraging Language ID To Calculate Intermediate CTC Loss For Enhanced Code-switching Speech Recognition (2023)0.00
- End-to-end ASR For Code-switched Hindi-english Speech (2019)0.00