The Codecfake Dataset And Countermeasures For The Universally Detection Of Deepfake Audio
2024 Β· Yuankun Xie, Yi Lu, Ruibo Fu, et al.
Abstract
With the proliferation of Audio Language Model (ALM) based deepfake audio, there is an urgent need for generalized detection methods. ALM-based deepfake audio currently exhibits widespread, high deception, and type versatility, posing a significant challenge to current audio deepfake detection (ADD) models trained solely on vocoded data. To effectively detect ALM-based deepfake audio, we focus on the mechanism of the ALM-based audio generation method, the conversion from neural codec to waveform. We initially constructed the Codecfake dataset, an open-source, large-scale collection comprising over 1 million audio samples in both English and Chinese, focus on ALM-based audio detection. As countermeasure, to achieve universal detection of deepfake audio and tackle domain ascent bias issue of original sharpness aware minimization (SAM), we propose the CSAM strategy to learn a domain balanced and generalized minima. In our experiments, we first demonstrate that ADD model training with the
Authors
(none)
Tags
Stats
Related papers
- Adversarial Attacks On Audio Deepfake Detection: A Benchmark And Comparative Study (2025)0.00
- What To Remember: Self-adaptive Continual Learning For Audio Deepfake Detection (2023)10.48
- AUDETER: A Large-scale Dataset For Deepfake Audio Detection In Open Worlds (2025)0.00
- MFAAN: Unveiling Audio Deepfakes With A Multi-feature Authenticity Network (2023)7.81
- MLAAD: The Multi-language Audio Anti-spoofing Dataset (2024)13.34
- SLIM: Style-linguistics Mismatch Model For Generalized Audio Deepfake Detection (2024)4.52
- Benchmarking Audio Deepfake Detection Robustness In Real-world Communication Scenarios (2025)5.24
- Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm With Real Emphasis And Fake Dispersion Strategy (2024)5.84