Papers
arxiv:2408.08653

GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model

Published on Aug 16, 2024
Authors:
,
,

Abstract

A new dataset, GAPS, and a benchmark model for guitar transcription achieve state-of-the-art performance on GuitarSet, using real audio-score aligned pairs and high-resolution MIDI alignments.

AI-generated summary

We introduce GAPS (Guitar-Aligned Performance Scores), a new dataset of classical guitar performances, and a benchmark guitar transcription model that achieves state-of-the-art performance on GuitarSet in both supervised and zero-shot settings. GAPS is the largest dataset of real guitar audio, containing 14 hours of freely available audio-score aligned pairs, recorded in diverse conditions by over 200 performers, together with high-resolution note-level MIDI alignments and performance videos. These enable us to train a state-of-the-art model for automatic transcription of solo guitar recordings which can generalise well to real world audio that is unseen during training.

Community

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2408.08653 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.