cs.DL
29 papers tagged cs.DL (ordered by heat_score)
Papers
- Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions (2026)Antonia Karamolegkou et al.5.06
- ASMD: an automatic framework for compiling multimodal datasets with
audio and scores (2021)Federico Simonetta et al.β
- The HaMSE Ontology: Using Semantic Technologies to support Music
Representation Interoperability and Musicological Analysis (2023)Andrea Poltronieri and Aldo Gangemiβ
- Partitura: A Python Package for Symbolic Music Processing (2022)Carlos Cancino-Chac\'on et al.β
- Analysis and Detection of Singing Techniques in Repertoires of J-POP
Solo Singers (2022)Yuya Yamamoto et al.β
- Large-Scale Automatic Audiobook Creation (2023)Brendan Walsh et al.β
- Predicting performance difficulty from piano sheet music images (2023)Pedro Ramoneda et al.β
- Encoding Performance Data in MEI with the Automatic Music Performance
Analysis and Comparison Toolkit (AMPACT) (2023)Johanna Devaney et al.β
- Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for
Tempo Prediction and Search (2024)Matthew C. McCallum et al.β
- A Semi-Automatic Approach to Create Large Gender- and Age-Balanced
Speaker Corpora: Usefulness of Speaker Diarization & Identification (2024)R\'emi Uro et al.β
- InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech
Activity Detection and Speaker Gender Segmentation (2024)David Doukhan and Christine Maertens and William Le Personnic and Ludovic Speroni and Reda Dehakβ
- GraphMuse: A Library for Symbolic Music Graph Processing (2024)Emmanouil Karystinaios and Gerhard Widmerβ
- HALvest-Contrastive: Retrieval-Like Authorship Attribution with Patch-Level Late Interaction (2026)Francis Kulumba et al.β
- Optical Music Recognition in Manuscripts from the Ricordi Archive (2024)Federico Simonetta et al.β
- A Survey on Spoken Italian Datasets and Corpora (2025)Marco Giordano et al.β
- Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music (2025)Venkatakrishnan Vaidyanathapuram Krishnan et al.β
- The GigaMIDI Dataset with Features for Expressive Music Performance
Detection (2025)Keon Ju Maverick Lee et al.β
- An Open Research Dataset of the 1932 Cairo Congress of Arab Music (2025)Baris Bozkurt (College of Interdisciplinary Studies et al.β
- KuiSCIMA v2.0: Improved Baselines, Calibration, and Cross-Notation Generalization for Historical Chinese Music Notations in Jiang Kui's Baishidaoren Gequ (2025)Tristan Repolusk et al.β
- The IRMA Dataset: A Structured Audio-MIDI Corpus for Iranian Classical Music (2025)Sepideh Shafiei and Shapour Hakamβ
- Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models (2026)Luigi Curini et al.β
- sciwrite-lint: Verification Infrastructure for the Age of Science Vibe-Writing (2026)Sergey V Samsonauβ
- Giving Voice to the Constitution: Low-Resource Text-to-Speech for Quechua and Spanish Using a Bilingual Legal Corpus (2026)John E. Ortega et al.β
- CitePrism: Human-in-the-Loop AI for Citation Auditing and Editorial Integrity (2026)Gowrika Mahesh et al.β
- Human-AI Collaboration in Science at Scale: A Global Large-scale Randomized Field Experiment (2026)Binglu Wang et al.β
- Rejoinder: The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review (2026)Buxin Su et al.β
- CiteCheck: Retrieval-Grounded Detection of LLM Citation Hallucinations in Scientific Text (2026)Khashayar Khajavi et al.β
- Verified Misguidance: Measuring Structural Citation Failures in Search-Augmented LLMs (2026)Yongsik Seo et al.β
- The Biosecurity Blind Spot: Systematic Dual-use Detection in Open Science Infrastructure (2026)Vasudha Sharma et al.β