Music4all A+A: A Multimodal Dataset For Music Information Retrieval Tasks
2025 · Jonas Geiger, Marta Moscati, Shah Nawaz, et al.
Abstract
Music is characterized by aspects related to different modalities, such as the audio signal, the lyrics, or the music video clips. This has motivated the development of multimodal datasets and methods for Music Information Retrieval (MIR) tasks such as genre classification or autotagging. Music can be described at different levels of granularity, for instance defining genres at the level of artists or music albums. However, most datasets for multimodal MIR neglect this aspect and provide data at the level of individual music tracks. We aim to fill this gap by providing Music4All Artist and Album (Music4All A+A), a dataset for multimodal MIR tasks based on music artists and albums. Music4All A+A is built on top of the Music4All-Onion dataset, an existing track-level dataset for MIR tasks. Music4All A+A provides metadata, genre labels, image representations, and textual descriptors for 6,741 artists and 19,511 albums. Furthermore, since Music4All A+A is built on top of Music4All-Onion, i
Authors
(none)
Tags
Stats
Related papers
- Musictm-dataset For Joint Representation Learning Among Sheet Music, Lyrics, And Musical Audio (2020)3.58
- Wikimute: A Web-sourced Dataset Of Semantic Descriptions For Music Audio (2023)5.24
- Cross-modal Music Retrieval And Applications: An Overview Of Key Methodologies (2019)12.68
- Incompebench: A Permissively Licensed, Fine-grained Benchmark For Music Information Retrieval (2026)0.00
- Artistmus: A Globally Diverse, Artist-centric Benchmark For Retrieval-augmented Music Question Answering (2025)0.00
- Exploring Modality-agnostic Representations For Music Classification (2021)0.00
- Contrastive Learning For Cross-modal Artist Retrieval (2023)0.00
- Multimodal Metric Learning For Tag-based Music Retrieval (2020)9.76