Collaborative Watermarking For Adversarial Speech Synthesis
2023 Β· Lauri Juvela, Xin Wang
Abstract
Advances in neural speech synthesis have brought us technology that is not only close to human naturalness, but is also capable of instant voice cloning with little data, and is highly accessible with pre-trained models available. Naturally, the potential flood of generated content raises the need for synthetic speech detection and watermarking. Recently, considerable research effort in synthetic speech detection has been related to the Automatic Speaker Verification and Spoofing Countermeasure Challenge (ASVspoof), which focuses on passive countermeasures. This paper takes a complementary view to generated speech detection: a synthesis system should make an active effort to watermark the generated speech in a way that aids detection by another machine, but remains transparent to a human listener. We propose a collaborative training scheme for synthetic speech watermarking and show that a HiFi-GAN neural vocoder collaborating with the ASVspoof 2021 baseline countermeasure models consis
Authors
(none)
Tags
Stats
Related papers
- Audio Codec Augmentation For Robust Collaborative Watermarking Of Speech Synthesis (2024)4.52
- Toward Improving Synthetic Audio Spoofing Detection Robustness Via Meta-learning And Disentangled Training With Adversarial Examples (2024)6.77
- Deep Residual Neural Networks For Audio Spoofing Detection (2019)0.00
- Wmcodec: End-to-end Neural Speech Codec With Deep Watermarking For Authenticity Verification (2024)2.26
- A Comparative Study On Recent Neural Spoofing Countermeasures For Synthetic Speech Detection (2021)0.00
- GROOT: Generating Robust Watermark For Diffusion-model-based Audio Synthesis (2024)6.77
- Spoofed Training Data For Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural Vocoders (2022)11.93
- SOLIDO: A Robust Watermarking Method For Speech Synthesis Via Low-rank Adaptation (2025)0.00