Transforming Acoustic Characteristics To Deceive Playback Spoofing Countermeasures Of Speaker Verification Systems
2018 Β· Fuming Fang, Junichi Yamagishi, Isao Echizen, et al.
Abstract
Automatic speaker verification (ASV) systems use a playback detector to filter out playback attacks and ensure verification reliability. Since current playback detection models are almost always trained using genuine and played-back speech, it may be possible to degrade their performance by transforming the acoustic characteristics of the played-back speech close to that of the genuine speech. One way to do this is to enhance speech "stolen" from the target speaker before playback. We tested the effectiveness of a playback attack using this method by using the speech enhancement generative adversarial network to transform acoustic characteristics. Experimental results showed that use of this "enhanced stolen speech" method significantly increases the equal error rates for the baseline used in the ASVspoof 2017 challenge and for a light convolutional neural network-based method. The results also showed that its use degrades the performance of a Gaussian mixture model-universal backgroun
Authors
(none)
Tags
Stats
Related papers
- Audio-replay Attack Detection Countermeasures (2017)6.34
- Toward Improving Synthetic Audio Spoofing Detection Robustness Via Meta-learning And Disentangled Training With Adversarial Examples (2024)6.77
- Anti-spoofing Methods For Automatic Speakerverification System (2017)2.26
- Replay Spoofing Countermeasure Using Autoencoder And Siamese Network On Asvspoof 2019 Challenge (2019)10.21
- One-class Learning Towards Synthetic Voice Spoofing Detection (2020)17.31
- Deep Generative Variational Autoencoding For Replay Spoof Detection In Automatic Speaker Verification (2020)9.76
- Generalizing Speaker Verification For Spoof Awareness In The Embedding Space (2024)7.16
- Automatic Speaker Verification Spoofing And Deepfake Detection Using Wav2vec 2.0 And Data Augmentation (2022)17.35