Exploring Methods For The Automatic Detection Of Errors In Manual Transcription
2019 Β· Xiaofei Wang, Jinyi Yang, Ruizhi Li, et al.
Abstract
Quality of data plays an important role in most deep learning tasks. In the speech community, transcription of speech recording is indispensable. Since the transcription is usually generated artificially, automatically finding errors in manual transcriptions not only saves time and labors but benefits the performance of tasks that need the training process. Inspired by the success of hybrid automatic speech recognition using both language model and acoustic model, two approaches of automatic error detection in the transcriptions have been explored in this work. Previous study using a biased language model approach, relying on a strong transcription-dependent language model, has been reviewed. In this work, we propose a novel acoustic model based approach, focusing on the phonetic sequence of speech. Both methods have been evaluated on a completely real dataset, which was originally transcribed with errors and strictly corrected manually afterwards.
Authors
(none)
Tags
Stats
Related papers
- Towards Better Decoding And Language Model Integration In Sequence To Sequence Models (2016)15.67
- Confidence Estimation And Deletion Prediction Using Bidirectional Recurrent Neural Networks (2018)9.23
- Improving Speech Recognition Error Prediction For Modern And Off-the-shelf Speech Recognizers (2024)5.24
- Generating Human Readable Transcript For Automatic Speech Recognition With Pre-trained Language Model (2021)0.00
- Spoken Term Detection Methods For Sparse Transcription In Very Low-resource Settings (2021)0.00
- Unsupervised Domain Adaptation For Speech Recognition With Unsupervised Error Correction (2022)5.24
- Beyond Voice Activity Detection: Hybrid Audio Segmentation For Direct Speech Translation (2021)0.00
- The Impact Of Automatic Speech Transcription On Speaker Attribution (2025)3.58