Ed-cec: Improving Rare Word Recognition Using Asr Postprocessing Based On Error Detection And Context-aware Error Correction
2023 Β· Jiajun He, Zekun Yang, Tomoki Toda
Abstract
Automatic speech recognition (ASR) systems often encounter difficulties in accurately recognizing rare words, leading to errors that can have a negative impact on downstream tasks such as keyword spotting, intent detection, and text summarization. To address this challenge, we present a novel ASR postprocessing method that focuses on improving the recognition of rare words through error detection and context-aware error correction. Our method optimizes the decoding process by targeting only the predicted error positions, minimizing unnecessary computations. Moreover, we leverage a rare word list to provide additional contextual knowledge, enabling the model to better correct rare words. Experimental results across five datasets demonstrate that our proposed method achieves significantly lower word error rates (WERs) than previous approaches while maintaining a reasonable inference speed. Furthermore, our approach exhibits promising robustness across different ASR systems.
Authors
(none)
Tags
Stats
Related papers
- Cross-modal ASR Post-processing System For Error Correction And Utterance Rejection (2022)0.00
- Improving Contextual Recognition Of Rare Words With An Alternate Spelling Prediction Model (2022)7.81
- GEC-RAG: Improving Generative Error Correction Via Retrieval-augmented Generation For Automatic Speech Recognition Systems (2025)0.00
- Multi-task Language Modeling For Improving Speech Recognition Of Rare Words (2020)8.35
- ASR Error Management For Improving Spoken Language Understanding (2017)9.92
- Improving Synthetic Data Training For Contextual Biasing Models With A Keyword-aware Cost Function (2025)0.00
- Improving Neural Biasing For Contextual Speech Recognition By Early Context Injection And Text Perturbation (2024)8.09
- Beyond Levenshtein: Leveraging Multiple Algorithms For Robust Word Error Rate Computations And Granular Error Classifications (2024)2.26