Unsupervised Broadcast News Summarization; A Comparative Study On Maximal Marginal Relevance (MMR) And Latent Semantic Analysis (LSA)
2023 Β· Majid Ramezani, Mohammad-Salar Shahryari, Amir-Reza Feizi-Derakhshi, et al.
Abstract
The methods of automatic speech summarization are classified into two groups: supervised and unsupervised methods. Supervised methods are based on a set of features, while unsupervised methods perform summarization based on a set of rules. Latent Semantic Analysis (LSA) and Maximal Marginal Relevance (MMR) are considered the most important and well-known unsupervised methods in automatic speech summarization. This study set out to investigate the performance of two aforementioned unsupervised methods in transcriptions of Persian broadcast news summarization. The results show that in generic summarization, LSA outperforms MMR, and in query-based summarization, MMR outperforms LSA in broadcast news summarization.
Authors
(none)
Tags
Stats
Related papers
- Team MTS @ Automin 2021: An Overview Of Existing Summarization Approaches And Comparison To Unsupervised Summarization Techniques (2024)0.00
- Speech Vs. Transcript: Does It Matter For Human Annotators In Speech Summarization? (2024)4.98
- Realizing Video Summarization From The Path Of Language-based Semantic Understanding (2024)0.00
- Sentence-wise Speech Summarization: Task, Datasets, And End-to-end Modeling With LM Knowledge Distillation (2024)5.84
- A Survey On Multi-modal Summarization (2021)12.93
- Prompting Large Language Models With Audio For General-purpose Speech Summarization (2024)6.34
- Augsumm: Towards Generalizable Speech Summarization Using Synthetic Labels From Large Language Model (2024)4.53
- Leverage Unlabeled Data For Abstractive Speech Summarization With Self-supervised Learning And Back-summarization (2020)2.26