Pp-met: A Real-world Personalized Prompt Based Meeting Transcription System
2023 Β· Xiang Lyu, Yuhang Cao, Qing Wang, et al.
Abstract
Speaker-attributed automatic speech recognition (SA-ASR) improves the accuracy and applicability of multi-speaker ASR systems in real-world scenarios by assigning speaker labels to transcribed texts. However, SA-ASR poses unique challenges due to factors such as speaker overlap, speaker variability, background noise, and reverberation. In this study, we propose PP-MeT system, a real-world personalized prompt based meeting transcription system, which consists of a clustering system, target-speaker voice activity detection (TS-VAD), and TS-ASR. Specifically, we utilize target-speaker embedding as a prompt in TS-VAD and TS-ASR modules in our proposed system. In constrast with previous system, we fully leverage pre-trained models for system initialization, thereby bestowing our approach with heightened generalizability and precision. Experiments on M2MeT2.0 Challenge dataset show that our system achieves a cp-CER of 11.27% on the test set, ranking first in both fixed and open training cond
Authors
(none)
Tags
Stats
Related papers
- The Second Multi-channel Multi-party Meeting Transcription Challenge (m2met) 2.0): A Benchmark For Speaker-attributed ASR (2023)6.77
- The Ustc-ximalaya System For The ICASSP 2022 Multi-channel Multi-party Meeting Transcription (m2met) Challenge (2022)6.34
- Summary On The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Grand Challenge (2022)10.35
- Improving Speaker Assignment In Speaker-attributed ASR For Real Meeting Applications (2024)0.00
- The Volcspeech System For The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (2022)5.84
- META-CAT: Speaker-informed Speech Embeddings Via Meta Information Concatenation For Multi-talker ASR (2024)3.58
- The Xmuspeech System For Multi-channel Multi-party Meeting Transcription Challenge (2022)0.00
- A Comparative Study On Speaker-attributed Automatic Speech Recognition In Multi-party Meetings (2022)8.09