Order-preserving Abstractive Summarization For Spoken Content Based On Connectionist Temporal Classification
2017 Β· Bo-Ru Lu, Frank Shyu, Yun-Nung Chen, et al.
Abstract
Connectionist temporal classification (CTC) is a powerful approach for sequence-to-sequence learning, and has been popularly used in speech recognition. The central ideas of CTC include adding a label "blank" during training. With this mechanism, CTC eliminates the need of segment alignment, and hence has been applied to various sequence-to-sequence learning problems. In this work, we applied CTC to abstractive summarization for spoken content. The "blank" in this case implies the corresponding input data are less important or noisy; thus it can be ignored. This approach was shown to outperform the existing methods in term of ROUGE scores over Chinese Gigaword and MATBN corpora. This approach also has the nice property that the ordering of words or characters in the input documents can be better preserved in the generated summaries.
Authors
(none)
Tags
Stats
Related papers
- Adding Connectionist Temporal Summarization Into Conformer To Improve Its Decoder Efficiency For Speech Recognition (2022)0.00
- Variational Connectionist Temporal Classification For Order-preserving Sequence Modeling (2023)5.24
- Knn-ctc: Enhancing ASR Via Retrieval Of CTC Pseudo Labels (2023)11.36
- Self-attention Networks For Connectionist Temporal Classification In Speech Recognition (2019)14.55
- A Study Of All-convolutional Encoders For Connectionist Temporal Classification (2017)5.84
- Speech Summarization Using Restricted Self-attention (2021)0.00
- Investigating The Reordering Capability In Ctc-based Non-autoregressive End-to-end Speech Translation (2021)0.00
- Leverage Unlabeled Data For Abstractive Speech Summarization With Self-supervised Learning And Back-summarization (2020)2.26