A Study On Convolutional Neural Network Based End-to-end Replay Anti-spoofing
2018 Β· Bhusan Chettri, Saumitra Mishra, Bob L. Sturm, et al.
Abstract
The second Automatic Speaker Verification Spoofing and Countermeasures challenge (ASVspoof 2017) focused on "replay attack" detection. The best deep-learning systems to compete in ASVspoof 2017 used Convolutional Neural Networks (CNNs) as a feature extractor. In this paper, we study their performance in an end-to-end setting. We find that these architectures show poor generalization in the evaluation dataset, but find a compact architecture that shows good generalization on the development data. We demonstrate that for this dataset it is not easy to obtain a similar level of generalization on both the development and evaluation data. This leads to a variety of open questions about what the differences are in the data; why these are more evident in an end-to-end setting; and how these issues can be overcome by increasing the training data.
Authors
(none)
Tags
Stats
Related papers
- Replay Spoofing Countermeasure Using Autoencoder And Siamese Network On Asvspoof 2019 Challenge (2019)10.21
- Audio-replay Attack Detection Countermeasures (2017)6.34
- Deep Residual Neural Networks For Audio Spoofing Detection (2019)0.00
- A Comparative Study On Recent Neural Spoofing Countermeasures For Synthetic Speech Detection (2021)0.00
- Replay Attack Detection With Complementary High-resolution Information Using End-to-end DNN For The Asvspoof 2019 Challenge (2019)11.39
- Deep Generative Variational Autoencoding For Replay Spoof Detection In Automatic Speaker Verification (2020)9.76
- Automatic Speaker Verification Spoofing And Deepfake Detection Using Wav2vec 2.0 And Data Augmentation (2022)17.35
- The DKU Replay Detection System For The Asvspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, And Fusion (2019)12.25