Phase Perturbation Improves Channel Robustness For Speech Spoofing Countermeasures
2023 Β· Yongyi Zang, You Zhang, Zhiyao Duan
Abstract
In this paper, we aim to address the problem of channel robustness in speech countermeasure (CM) systems, which are used to distinguish synthetic speech from human natural speech. On the basis of two hypotheses, we suggest an approach for perturbing phase information during the training of time-domain CM systems. Communication networks often employ lossy compression codec that encodes only magnitude information, therefore heavily altering phase information. Also, state-of-the-art CM systems rely on phase information to identify spoofed speech. Thus, we believe the information loss in the phase domain induced by lossy compression codec degrades the performance of the unseen channel. We first establish the dependence of time-domain CM systems on phase information by perturbing phase in evaluation, showing strong degradation. Then, we demonstrated that perturbing phase during training leads to a significant performance improvement, whereas perturbing magnitude leads to further degradation
Authors
(none)
Tags
Stats
Related papers
- An Empirical Study On Channel Effects For Synthetic Voice Spoofing Countermeasure Systems (2021)9.92
- Phaseperturbation: Speech Data Augmentation Via Phase Perturbation For Automatic Speech Recognition (2023)0.00
- The Partialspoof Database And Countermeasures For The Detection Of Short Fake Speech Segments Embedded In An Utterance (2022)14.06
- Universal Adversarial Perturbations For Speech Recognition Systems (2019)14.11
- Speech Enhancement In Adverse Environments Based On Non-stationary Noise-driven Spectral Subtraction And Snr-dependent Phase Compensation (2018)0.00
- Phase Continuity: Learning Derivatives Of Phase Spectrum For Speech Enhancement (2022)6.77
- Phase-aware Single-channel Speech Enhancement With Modulation-domain Kalman Filtering (2017)0.00
- A Spoofing Benchmark For The 2018 Voice Conversion Challenge: Leveraging From Spoofing Countermeasures For Speech Artifact Assessment (2018)8.09