Spoof Detection Using Time-delay Shallow Neural Network And Feature Switching
2019 Β· Mari Ganesh Kumar, Suvidha Rupesh Kumar, Saranya M, et al.
Abstract
Detecting spoofed utterances is a fundamental problem in voice-based biometrics. Spoofing can be performed either by logical accesses like speech synthesis, voice conversion or by physical accesses such as replaying the pre-recorded utterance. Inspired by the state-of-the-art *x*-vector based speaker verification approach, this paper proposes a time-delay shallow neural network (TD-SNN) for spoof detection for both logical and physical access. The novelty of the proposed TD-SNN system vis-a-vis conventional DNN systems is that it can handle variable length utterances during testing. Performance of the proposed TD-SNN systems and the baseline Gaussian mixture models (GMMs) is analyzed on the ASV-spoof-2019 dataset. The performance of the systems is measured in terms of the minimum normalized tandem detection cost function (min-t-DCF). When studied with individual features, the TD-SNN system consistently outperforms the GMM system for physical access. For logical access, GMM surpasses TD
Authors
(none)
Tags
Stats
Related papers
- Deep Residual Neural Networks For Audio Spoofing Detection (2019)0.00
- A Comparative Study On Recent Neural Spoofing Countermeasures For Synthetic Speech Detection (2021)0.00
- Spoofing Speaker Verification Systems With Deep Multi-speaker Text-to-speech Synthesis (2019)0.00
- Securing Voice Biometrics: One-shot Learning Approach For Audio Deepfake Detection (2023)9.03
- Spoofed Training Data For Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural Vocoders (2022)11.93
- Spoofing-robust Speaker Verification Using Parallel Embedding Fusion: BTU Speech Group's Approach For Asvspoof5 Challenge (2024)5.24
- Experimental Study: Enhancing Voice Spoofing Detection Models With Wav2vec 2.0 (2024)0.00
- Audio-replay Attack Detection Countermeasures (2017)6.34