Fakemusiccaps: A Dataset For Detection And Attribution Of Synthetic Music Generated Via Text-to-music Models
2024 Β· Luca Comanducci, Paolo Bestagini, Stefano Tubaro
Abstract
Text-To-Music (TTM) models have recently revolutionized the automatic music generation research field. Specifically, by reaching superior performances to all previous state-of-the-art models and by lowering the technical proficiency needed to use them. Due to these reasons, they have readily started to be adopted for commercial uses and music production practices. This widespread diffusion of TTMs poses several concerns regarding copyright violation and rightful attribution, posing the need of serious consideration of them by the audio forensics community. In this paper, we tackle the problem of detection and attribution of TTM-generated data. We propose a dataset, FakeMusicCaps that contains several versions of the music-caption pairs dataset MusicCaps re-generated via several state-of-the-art TTM techniques. We evaluate the proposed dataset by performing initial experiments regarding the detection and attribution of TTM-generated audio.
Authors
(none)
Tags
Stats
Related papers
- Muscaps: Generating Captions For Music Audio (2021)9.59
- Quality-aware Masked Diffusion Transformer For Enhanced Music Generation (2024)5.60
- Musictm-dataset For Joint Representation Learning Among Sheet Music, Lyrics, And Musical Audio (2020)3.58
- SONICS: Synthetic Or Not -- Identifying Counterfeit Songs (2024)0.00
- Annotation-free Automatic Music Transcription With Scalable Synthetic Data And Adversarial Domain Confusion (2023)4.52
- Mustango: Toward Controllable Text-to-music Generation (2023)11.67
- Exploring Compressibility Of Transformer Based Text-to-music (TTM) Models (2024)0.00
- Lightweight Model Attribution And Detection Of Synthetic Speech Via Audio Residual Fingerprints (2024)0.00