Safespeech: Robust And Universal Voice Protection Against Malicious Speech Synthesis
2025 Β· Zhisheng Zhang, Derui Wang, Qianyi Yang, et al.
Abstract
Speech synthesis technology has brought great convenience, while the widespread usage of realistic deepfake audio has triggered hazards. Malicious adversaries may unauthorizedly collect victims' speeches and clone a similar voice for illegal exploitation (\textit\{e.g.\}, telecom fraud). However, the existing defense methods cannot effectively prevent deepfake exploitation and are vulnerable to robust training techniques. Therefore, a more effective and robust data protection method is urgently needed. In response, we propose a defensive framework, \textit\{\textbf\{SafeSpeech\}\}, which protects the users' audio before uploading by embedding imperceptible perturbations on original speeches to prevent high-quality synthetic speech. In SafeSpeech, we devise a robust and universal proactive protection technique, \textbf\{S\}peech \textbf\{PE\}rturbative \textbf\{C\}oncealment (\textbf\{SPEC\}), that leverages a surrogate model to generate universally applicable perturbation for generativ
Authors
(none)
Tags
Stats
Related papers
- Vsmask: Defending Against Voice Synthesis Attack Via Real-time Predictive Perturbation (2023)7.81
- Adversarial Speech For Voice Privacy Protection From Personalized Speech Generation (2024)8.09
- Combining Automatic Speaker Verification And Prosody Analysis For Synthetic Speech Detection (2022)10.48
- Defense Against Synthetic Speech: Real-time Detection Of RVC Voice Conversion Attacks (2025)0.00
- Securing Voice-driven Interfaces Against Fake (cloned) Audio Attacks (2019)9.92
- Securing Voice Biometrics: One-shot Learning Approach For Audio Deepfake Detection (2023)9.03
- One-class Learning Towards Synthetic Voice Spoofing Detection (2020)17.31
- Collaborative Watermarking For Adversarial Speech Synthesis (2023)0.00