An Automatic Speech Recognition System For Bengali Language Based On Wav2vec2 And Transfer Learning
2022 Β· Tushar Talukder Showrav
Abstract
An independent, automated method of decoding and transcribing oral speech is known as automatic speech recognition (ASR). A typical ASR system extracts feature from audio recordings or streams and run one or more algorithms to map the features to corresponding texts. Numerous of research has been done in the field of speech signal processing in recent years. When given adequate resources, both conventional ASR and emerging end-to-end (E2E) speech recognition have produced promising results. However, for low-resource languages like Bengali, the current state of ASR lags behind, although the low resource state does not reflect upon the fact that this language is spoken by over 500 million people all over the world. Despite its popularity, there aren't many diverse open-source datasets available, which makes it difficult to conduct research on Bengali speech recognition systems. This paper is a part of the competition named `BUET CSE Fest DL Sprint'. The purpose of this paper is to improv
Authors
(none)
Tags
Stats
Related papers
- Bangla-wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models (2022)5.24
- Investigating Self-supervised, Weakly Supervised And Fully Supervised Training Approaches For Multi-domain Automatic Speech Recognition: A Study On Bangladeshi Bangla (2022)0.00
- Multi-level Embedding Conformer Framework For Bengali Automatic Speech Recognition (2025)0.00
- Whisper Turns Stronger: Augmenting Wav2vec 2.0 For Superior ASR In Low-resource Languages (2024)0.00
- Byakto Speech: Real-time Long Speech Synthesis With Convolutional Neural Network: Transfer Learning From English To Bangla (2021)3.93
- K-wav2vec 2.0: Automatic Speech Recognition Based On Joint Decoding Of Graphemes And Syllables (2021)7.16
- Balanced End-to-end Monolingual Pre-training For Low-resourced Indic Languages Code-switching Speech Recognition (2021)0.00
- Dialect Adaptation And Data Augmentation For Low-resource ASR: Taltech Systems For The MADASR 2023 Challenge (2023)6.34