The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric And Baselines
2022 Β· Gaofeng Cheng, Yifan Chen, Runyan Yang, et al.
Abstract
The conversation scenario is one of the most important and most challenging scenarios for speech processing technologies because people in conversation respond to each other in a casual style. Detecting the speech activities of each person in a conversation is vital to downstream tasks, like natural language processing, machine translation, etc. People refer to the detection technology of "who speak when" as speaker diarization (SD). Traditionally, diarization error rate (DER) has been used as the standard evaluation metric of SD systems for a long time. However, DER fails to give enough importance to short conversational phrases, which are short but important on the semantic level. Also, a carefully and accurately manually-annotated testing dataset suitable for evaluating the conversational SD technologies is still unavailable in the speech community. In this paper, we design and describe the Conversational Short-phrases Speaker Diarization (CSSD) task, which consists of training and
Authors
(none)
Tags
Stats
Related papers
- TSUP Speaker Diarization System For Conversational Short-phrase Speaker Diarization Challenge (2022)5.24
- Aligning Speakers: Evaluating And Visualizing Text-based Diarization Using Efficient Multiple Sequence Alignment (extended Version) (2023)0.00
- DISPLACE Challenge: Diarization Of Speaker And Language In Conversational Environments (2023)0.00
- Summary Of The DISPLACE Challenge 2023 - Diarization Of Speaker And Language In Conversational Environments (2023)0.00
- Sd-eval: A Benchmark Dataset For Spoken Dialogue Understanding Beyond Words (2024)11.32
- An Experimental Review Of Speaker Diarization Methods With Application To Two-speaker Conversational Telephone Speech Recordings (2023)8.35
- Exploring Speaker-related Information In Spoken Language Understanding For Better Speaker Diarization (2023)0.00
- The Second DIHARD Diarization Challenge: Dataset, Task, And Baselines (2019)15.00