Multilingual Approach To Joint Speech And Accent Recognition With DNN-HMM Framework
2020 Β· Yizhou Peng, Jicheng Zhang, Haobo Zhang, et al.
Abstract
Human can recognize speech, as well as the peculiar accent of the speech simultaneously. However, present state-of-the-art ASR system can rarely do that. In this paper, we propose a multilingual approach to recognizing English speech, and related accent that speaker conveys using DNN-HMM framework. Specifically, we assume different accents of English as different languages. We then merge them together and train a multilingual ASR system. During decoding, we conduct two experiments. One is a monolingual ASR-based decoding, with the accent information embedded at phone level, realizing word-based accent recognition (AR), and the other is a multilingual ASR-based decoding, realizing an approximated utterance-based AR. Experimental results on an 8-accent English speech recognition show both methods can yield WERs close to the conventional ASR systems that completely ignore the accent, as well as desired AR accuracy. Besides, we conduct extensive analysis for the proposed method, such as tr
Authors
(none)
Tags
Stats
Related papers
- E2e-based Multi-task Learning Approach To Joint Speech And Accent Recognition (2021)0.00
- Decoupling And Interacting Multi-task Learning Network For Joint Speech And Accent Recognition (2023)9.03
- Accent And Speaker Disentanglement In Many-to-many Voice Conversion (2020)10.35
- Investigation Of Deep Neural Network Acoustic Modelling Approaches For Low Resource Accented Mandarin Speech Recognition (2022)0.00
- Dyn-asr: Compact, Multilingual Speech Recognition Via Spoken Language And Accent Identification (2021)5.24
- Analysis Of Multilingual Sequence-to-sequence Speech Recognition Systems (2018)0.00
- A Highly Adaptive Acoustic Model For Accurate Multi-dialect Speech Recognition (2022)10.85
- Investigating The Impact Of Cross-lingual Acoustic-phonetic Similarities On Multilingual Speech Recognition (2022)3.58