Betray Oneself: A Novel Audio Deepfake Detection Model Via Mono-to-stereo Conversion
2023 · Rui Liu, Jinhua Zhang, Guanglai Gao, et al.
Abstract
Audio Deepfake Detection (ADD) aims to detect the fake audio generated by text-to-speech (TTS), voice conversion (VC) and replay, etc., which is an emerging topic. Traditionally we take the mono signal as input and focus on robust feature extraction and effective classifier design. However, the dual-channel stereo information in the audio signal also includes important cues for deepfake, which has not been studied in the prior work. In this paper, we propose a novel ADD model, termed as M2S-ADD, that attempts to discover audio authenticity cues during the mono-to-stereo conversion process. We first projects the mono to a stereo signal using a pretrained stereo synthesizer, then employs a dual-branch neural architecture to process the left and right channel signals, respectively. In this way, we effectively reveal the artifacts in the fake audio, thus improve the ADD performance. The experiments on the ASVspoof2019 database show that M2S-ADD outperforms all baselines that input mono. We
Authors
(none)
Tags
Stats
Related papers
- Heterogeneity Over Homogeneity: Investigating Multilingual Speech Pre-trained Models For Detecting Audio Deepfake (2024)8.09
- Transsionadd: A Multi-frame Reinforcement Based Sequence Tagging Model For Audio Deepfake Detection (2023)0.00
- Self-attention And Hybrid Features For Replay And Deep-fake Audio Detection (2024)0.00
- The Vicomtech Audio Deepfake Detection System Based On Wav2vec2 For The 2022 ADD Challenge (2022)14.06
- MFAAN: Unveiling Audio Deepfakes With A Multi-feature Authenticity Network (2023)7.81
- SLIM: Style-linguistics Mismatch Model For Generalized Audio Deepfake Detection (2024)4.52
- Hm-conformer: A Conformer-based Audio Deepfake Detection System With Hierarchical Pooling And Multi-level Classification Token Aggregation Methods (2023)9.03
- Adversarial Attacks On Audio Deepfake Detection: A Benchmark And Comparative Study (2025)0.00