Zero-day Audio Deepfake Detection Via Retrieval Augmentation And Profile Matching
2025 Β· Xuechen Liu, Xin Wang, Junichi Yamagishi
Abstract
Modern audio deepfake detectors built on foundation models and large training datasets achieve promising detection performance. However, they struggle with zero-day attacks, where the audio samples are generated by novel synthesis methods that models have not seen from reigning training data. Conventional approaches fine-tune the detector, which can be problematic when prompt response is needed. This paper proposes a training-free retrieval-augmented framework for zero-day audio deepfake detection that leverages knowledge representations and voice profile matching. Within this framework, we propose simple yet effective retrieval and ensemble methods that reach performance comparable to supervised baselines and their fine-tuned counterparts on the DeepFake-Eval-2024 benchmark, without any additional model training. We also conduct ablation on voice profile attributes, and demonstrate the cross-database generalizability of the framework with introducing simple and training-free fusion st
Authors
(none)
Tags
Stats
Related papers
- Training-free Deepfake Voice Recognition By Leveraging Large-scale Pre-trained Models (2024)9.23
- Towards Robust Audio Deepfake Detection: A Evolving Benchmark For Continual Learning (2024)0.00
- Adversarial Attacks On Audio Deepfake Detection: A Benchmark And Comparative Study (2025)0.00
- Anomaly Detection And Localization For Speech Deepfakes Via Feature Pyramid Matching (2025)4.52
- Securing Voice Biometrics: One-shot Learning Approach For Audio Deepfake Detection (2023)9.03
- Pitch Imperfect: Detecting Audio Deepfakes Through Acoustic Prosodic Analysis (2025)0.00
- Adaptive Re-calibration Of Channel-wise Features For Adversarial Audio Classification (2022)0.00
- ERF-BA-TFD+: A Multimodal Model For Audio-visual Deepfake Detection (2025)2.26