Detecting Spoof Voices In Asian Non-native Speech: An Indonesian And Thai Case Study
2024 Β· Aulia Adila, Candy Olivia Mawalim, Masashi Unoki
Abstract
This study focuses on building effective spoofing countermeasures (CMs) for non-native speech, specifically targeting Indonesian and Thai speakers. We constructed a dataset comprising both native and non-native speech to facilitate our research. Three key features (MFCC, LFCC, and CQCC) were extracted from the speech data, and three classic machine learning-based classifiers (CatBoost, XGBoost, and GMM) were employed to develop robust spoofing detection systems using the native and combined (native and non-native) speech data. This resulted in two types of CMs: Native and Combined. The performance of these CMs was evaluated on both native and non-native speech datasets. Our findings reveal significant challenges faced by Native CM in handling non-native speech, highlighting the necessity for domain-specific solutions. The proposed method shows improved detection capabilities, demonstrating the importance of incorporating non-native speech data into the training process. This work lays
Authors
(none)
Tags
Stats
Related papers
- An Empirical Study On Channel Effects For Synthetic Voice Spoofing Countermeasure Systems (2021)9.92
- A Comparative Study On Recent Neural Spoofing Countermeasures For Synthetic Speech Detection (2021)0.00
- Spoofed Training Data For Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural Vocoders (2022)11.93
- Audio-replay Attack Detection Countermeasures (2017)6.34
- One-class Learning Towards Synthetic Voice Spoofing Detection (2020)17.31
- The Partialspoof Database And Countermeasures For The Detection Of Short Fake Speech Segments Embedded In An Utterance (2022)14.06
- Toward Improving Synthetic Audio Spoofing Detection Robustness Via Meta-learning And Disentangled Training With Adversarial Examples (2024)6.77
- Spoof Detection Using Time-delay Shallow Neural Network And Feature Switching (2019)8.35