Towards Robust And Truly Large-scale Audio-sheet Music Retrieval
2023 Β· Luis Carvalho, Gerhard Widmer
Abstract
A range of applications of multi-modal music information retrieval is centred around the problem of connecting large collections of sheet music (images) to corresponding audio recordings, that is, identifying pairs of audio and score excerpts that refer to the same musical content. One of the typical and most recent approaches to this task employs cross-modal deep learning architectures to learn joint embedding spaces that link the two distinct modalities - audio and sheet music images. While there has been steady improvement on this front over the past years, a number of open problems still prevent large-scale employment of this methodology. In this article we attempt to provide an insightful examination of the current developments on audio-sheet music retrieval via deep learning methods. We first identify a set of main challenges on the road towards robust and large-scale cross-modal music retrieval in real scenarios. We then highlight the steps we have taken so far to address some o
Authors
(none)
Tags
Stats
Related papers
- Self-supervised Contrastive Learning For Robust Audio-sheet Music Retrieval Systems (2023)5.24
- Cross-modal Music Retrieval And Applications: An Overview Of Key Methodologies (2019)12.68
- Passage Summarization With Recurrent Models For Audio-sheet Music Retrieval (2023)0.00
- Towards End-to-end Audio-sheet-music Retrieval (2016)0.00
- Learning Soft-attention Models For Tempo-invariant Audio-sheet Music Retrieval (2019)0.00
- Musictm-dataset For Joint Representation Learning Among Sheet Music, Lyrics, And Musical Audio (2020)3.58
- Exploring Modality-agnostic Representations For Music Classification (2021)0.00
- Contrastive Learning For Cross-modal Artist Retrieval (2023)0.00