Fast Contextual Adaptation With Neural Associative Memory For On-device Personalized Speech Recognition
2021 Β· Tsendsuren Munkhdalai, Khe Chai Sim, Angad Chandorkar, et al.
Abstract
Fast contextual adaptation has shown to be effective in improving Automatic Speech Recognition (ASR) of rare words and when combined with an on-device personalized training, it can yield an even better recognition result. However, the traditional re-scoring approaches based on an external language model is prone to diverge during the personalized training. In this work, we introduce a model-based end-to-end contextual adaptation approach that is decoder-agnostic and amenable to on-device personalization. Our on-device simulation experiments demonstrate that the proposed approach outperforms the traditional re-scoring technique by 12% relative WER and 15.7% entity mention specific F1-score in a continues personalization scenario.
Authors
(none)
Tags
Stats
Related papers
- Contextual Adapters For Personalized Speech Recognition In Neural Transducers (2022)12.47
- Towards Personalization Of CTC Speech Recognition Models With Contextual Adapters And Adaptive Boosting (2022)0.00
- PROCTER: Pronunciation-aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers (2023)8.60
- Using External Off-policy Speech-to-text Mappings In Contextual End-to-end Automated Speech Recognition (2023)0.00
- The Universal Personalizer: Few-shot Dysarthric Speech Recognition Via Meta-learning (2025)0.00
- Instant One-shot Word-learning For Context-specific Neural Sequence-to-sequence Speech Recognition (2021)9.59
- Attention-based Contextual Language Model Adaptation For Speech Recognition (2021)0.00
- Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource Languages (2023)4.52