PB-LRDWWS System For The SLT 2024 Low-resource Dysarthria Wake-up Word Spotting Challenge
2024 Β· Shiyao Wang, Jiaming Zhou, Shiwan Zhao, et al.
Abstract
For the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting (LRDWWS) Challenge, we introduce the PB-LRDWWS system. This system combines a dysarthric speech content feature extractor for prototype construction with a prototype-based classification method. The feature extractor is a fine-tuned HuBERT model obtained through a three-stage fine-tuning process using cross-entropy loss. This fine-tuned HuBERT extracts features from the target dysarthric speaker's enrollment speech to build prototypes. Classification is achieved by calculating the cosine similarity between the HuBERT features of the target dysarthric speaker's evaluation speech and prototypes. Despite its simplicity, our method demonstrates effectiveness through experimental results. Our system achieves second place in the final Test-B of the LRDWWS Challenge.
Authors
(none)
Tags
Stats
Related papers
- Optimizing Dysarthria Wake-up Word Spotting: An End-to-end Approach For SLT 2024 LRDWWS Challenge (2024)2.26
- Enhancing Dysarthric Speech Recognition For Unseen Speakers Via Prototype-based Adaptation (2024)9.45
- Taltech-irit-lis Speaker And Language Diarization Systems For DISPLACE 2024 (2024)4.52
- Learning To Detect Dysarthria From Raw Speech (2018)11.85
- Using Speech Technology For Quantifying Behavioral Characteristics In Peer-led Team Learning Sessions (2017)7.50
- Bbs-kws:the Mandarin Keyword Spotting System Won The Video Keyword Wakeup Challenge (2021)0.00
- BERT-LID: Leveraging BERT To Improve Spoken Language Identification (2022)8.09
- The Zero Resource Speech Challenge 2020: Discovering Discrete Subword And Word Units (2020)11.58