SONICS: Synthetic Or Not -- Identifying Counterfeit Songs
2024 Β· Md Awsafur Rahman, Zaber Ibn Abdul Hakim, Najibul Haque Sarker, et al.
Abstract
The recent surge in AI-generated songs presents exciting possibilities and challenges. These innovations necessitate the ability to distinguish between human-composed and synthetic songs to safeguard artistic integrity and protect human musical artistry. Existing research and datasets in fake song detection only focus on singing voice deepfake detection (SVDD), where the vocals are AI-generated but the instrumental music is sourced from real songs. However, these approaches are inadequate for detecting contemporary end-to-end artificial songs where all components (vocals, music, lyrics, and style) could be AI-generated. Additionally, existing datasets lack music-lyrics diversity, long-duration songs, and open-access fake songs. To address these gaps, we introduce SONICS, a novel dataset for end-to-end Synthetic Song Detection (SSD), comprising over 97k songs (4,751 hours) with over 49k synthetic songs from popular platforms like Suno and Udio. Furthermore, we highlight the importance o
Authors
(none)
Tags
Stats
Related papers
- Singfake: Singing Voice Deepfake Detection (2023)11.93
- Detection Of Ai-synthesized Speech Using Cepstral & Bispectral Statistics (2020)0.00
- AUDETER: A Large-scale Dataset For Deepfake Audio Detection In Open Worlds (2025)0.00
- Syn-att: Synthetic Speech Attribution Via Semi-supervised Unknown Multi-class Ensemble Of Cnns (2023)0.00
- The Sound Of Silence: Efficiency Of First Digit Features In Synthetic Audio Detection (2022)7.50
- Ctrsvdd: A Benchmark Dataset And Baseline Analysis For Controlled Singing Voice Deepfake Detection (2024)0.00
- Fakemusiccaps: A Dataset For Detection And Attribution Of Synthetic Music Generated Via Text-to-music Models (2024)0.00
- Adversarial Attacks On Audio Deepfake Detection: A Benchmark And Comparative Study (2025)0.00