Self-supervised Speech Representations Preserve Speech Characteristics While Anonymizing Voices
2022 · Abner Hernandez, Paula Andrea Pérez-Toro, Juan Camilo Vásquez-Correa, et al.
Abstract
Collecting speech data is an important step in training speech recognition systems and other speech-based machine learning models. However, the issue of privacy protection is an increasing concern that must be addressed. The current study investigates the use of voice conversion as a method for anonymizing voices. In particular, we train several voice conversion models using self-supervised speech representations including Wav2Vec2.0, Hubert and UniSpeech. Converted voices retain a low word error rate within 1% of the original voice. Equal error rate increases from 1.52% to 46.24% on the LibriSpeech test set and from 3.75% to 45.84% on speakers from the VCTK corpus which signifies degraded performance on speaker verification. Lastly, we conduct experiments on dysarthric speech data to show that speech features relevant to articulation, prosody, phonation and phonology can be extracted from anonymized voices for discriminating between healthy and pathological speech.
Authors
(none)
Tags
Stats
Related papers
- Improving Voice Quality In Speech Anonymization With Just Perception-informed Losses (2024)0.00
- Anonymising Elderly And Pathological Speech: Voice Conversion Using DDSP And Query-by-example (2024)4.52
- Reprogramming Self-supervised Learning-based Speech Representations For Speaker Anonymization (2023)2.26
- A Speech Representation Anonymization Framework Via Selective Noise Perturbation (2022)6.34
- Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding (2024)7.16
- Preserving Spoken Content In Voice Anonymisation With Character-level Vocoder Conditioning (2024)3.58
- Adversarial Speaker Disentanglement Using Unannotated External Data For Self-supervised Representation Based Voice Conversion (2023)6.34
- Language-independent Speaker Anonymization Approach Using Self-supervised Pre-trained Models (2022)9.92