Dyn-asr: Compact, Multilingual Speech Recognition Via Spoken Language And Accent Identification
2021 Β· Sangeeta Ghangam, Daniel Whitenack, Joshua Nemecek
Abstract
Running automatic speech recognition (ASR) on edge devices is non-trivial due to resource constraints, especially in scenarios that require supporting multiple languages. We propose a new approach to enable multilingual speech recognition on edge devices. This approach uses both language identification and accent identification to select one of multiple monolingual ASR models on-the-fly, each fine-tuned for a particular accent. Initial results for both recognition performance and resource usage are promising with our approach using less than 1/12th of the memory consumed by other solutions.
Authors
(none)
Tags
Stats
Related papers
- Tiny-align: Bridging Automatic Speech Recognition And Large Language Model On The Edge (2024)0.00
- Adaptive Activation Network For Low Resource Multilingual Speech Recognition (2022)0.00
- Multilingual Approach To Joint Speech And Accent Recognition With DNN-HMM Framework (2020)0.00
- Dynamic Sparsity Neural Networks For Automatic Speech Recognition (2020)0.00
- Pi-whisper: Designing An Adaptive And Incremental Automatic Speech Recognition System For Edge Devices (2024)0.00
- Effectiveasr: A Single-step Non-autoregressive Mandarin Speech Recognition Architecture With High Accuracy And Inference Speed (2024)3.58
- Streaming Language Identification Using Combination Of Acoustic Representations And ASR Hypotheses (2020)0.00
- Master-asr: Achieving Multilingual Scalability And Low-resource Adaptation In ASR With Modular Learning (2023)0.00