Improving Punctuation Restoration For Speech Transcripts Via External Data
2021 Β· Xue-Yong Fu, Cheng Chen, Md Tahmid Rahman Laskar, et al.
Abstract
Automatic Speech Recognition (ASR) systems generally do not produce punctuated transcripts. To make transcripts more readable and follow the expected input format for downstream language models, it is necessary to add punctuation marks. In this paper, we tackle the punctuation restoration problem specifically for the noisy text (e.g., phone conversation scenarios). To leverage the available written text datasets, we introduce a data sampling technique based on an n-gram language model to sample more training data that are similar to our in-domain data. Moreover, we propose a two-stage fine-tuning approach that utilizes the sampled external data as well as our in-domain dataset for models based on BERT. Extensive experiments show that the proposed approach outperforms the baseline with an improvement of 1:12% F1 score.
Authors
(none)
Tags
Stats
Related papers
- Punctuation Restoration In Spanish Customer Support Transcripts Using Transfer Learning (2022)2.26
- End To End ASR System With Automatic Punctuation Insertion (2020)0.00
- Efficient Ensemble For Multimodal Punctuation Restoration Using Time-delay Neural Network (2023)1.91
- Unified Multimodal Punctuation Restoration Framework For Mixed-modality Corpus (2022)7.16
- Streaming Punctuation: A Novel Punctuation Technique Leveraging Bidirectional Context For Continuous Speech Recognition (2023)4.52
- Generating Human Readable Transcript For Automatic Speech Recognition With Pre-trained Language Model (2021)0.00
- Improved Training For End-to-end Streaming Automatic Speech Recognition Model With Punctuation (2023)0.00
- Multimodal Semi-supervised Learning Framework For Punctuation Prediction In Conversational Speech (2020)9.59