Awesome Papers

Papers

Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions (2026)
Antonia Karamolegkou et al.
5.06
ASMD: an automatic framework for compiling multimodal datasets with audio and scores (2021)
Federico Simonetta et al.
—
The HaMSE Ontology: Using Semantic Technologies to support Music Representation Interoperability and Musicological Analysis (2023)
Andrea Poltronieri and Aldo Gangemi
—
Partitura: A Python Package for Symbolic Music Processing (2022)
Carlos Cancino-Chac\'on et al.
—
Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers (2022)
Yuya Yamamoto et al.
—
Large-Scale Automatic Audiobook Creation (2023)
Brendan Walsh et al.
—
Predicting performance difficulty from piano sheet music images (2023)
Pedro Ramoneda et al.
—
Encoding Performance Data in MEI with the Automatic Music Performance Analysis and Comparison Toolkit (AMPACT) (2023)
Johanna Devaney et al.
—
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search (2024)
Matthew C. McCallum et al.
—
A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification (2024)
R\'emi Uro et al.
—
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation (2024)
David Doukhan and Christine Maertens and William Le Personnic and Ludovic Speroni and Reda Dehak
—
GraphMuse: A Library for Symbolic Music Graph Processing (2024)
Emmanouil Karystinaios and Gerhard Widmer
—
HALvest-Contrastive: Retrieval-Like Authorship Attribution with Patch-Level Late Interaction (2026)
Francis Kulumba et al.
—
Optical Music Recognition in Manuscripts from the Ricordi Archive (2024)
Federico Simonetta et al.
—
A Survey on Spoken Italian Datasets and Corpora (2025)
Marco Giordano et al.
—
Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music (2025)
Venkatakrishnan Vaidyanathapuram Krishnan et al.
—
The GigaMIDI Dataset with Features for Expressive Music Performance Detection (2025)
Keon Ju Maverick Lee et al.
—
An Open Research Dataset of the 1932 Cairo Congress of Arab Music (2025)
Baris Bozkurt (College of Interdisciplinary Studies et al.
—
KuiSCIMA v2.0: Improved Baselines, Calibration, and Cross-Notation Generalization for Historical Chinese Music Notations in Jiang Kui's Baishidaoren Gequ (2025)
Tristan Repolusk et al.
—
The IRMA Dataset: A Structured Audio-MIDI Corpus for Iranian Classical Music (2025)
Sepideh Shafiei and Shapour Hakam
—
Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models (2026)
Luigi Curini et al.
—
sciwrite-lint: Verification Infrastructure for the Age of Science Vibe-Writing (2026)
Sergey V Samsonau
—
Giving Voice to the Constitution: Low-Resource Text-to-Speech for Quechua and Spanish Using a Bilingual Legal Corpus (2026)
John E. Ortega et al.
—
CitePrism: Human-in-the-Loop AI for Citation Auditing and Editorial Integrity (2026)
Gowrika Mahesh et al.
—
Human-AI Collaboration in Science at Scale: A Global Large-scale Randomized Field Experiment (2026)
Binglu Wang et al.
—
Rejoinder: The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review (2026)
Buxin Su et al.
—
CiteCheck: Retrieval-Grounded Detection of LLM Citation Hallucinations in Scientific Text (2026)
Khashayar Khajavi et al.
—
Verified Misguidance: Measuring Structural Citation Failures in Search-Augmented LLMs (2026)
Yongsik Seo et al.
—
The Biosecurity Blind Spot: Systematic Dual-use Detection in Open Science Infrastructure (2026)
Vasudha Sharma et al.
—