Transactions of the International Society for Music Information Retrieval (Sep 2023)

PiJAMA: Piano Jazz with Automatic MIDI Annotations

  • Drew Edwards,
  • Simon Dixon,
  • Emmanouil Benetos

DOI
https://doi.org/10.5334/tismir.162
Journal volume & issue
Vol. 6, no. 1
pp. 89–102 – 89–102

Abstract

Read online

Recent advances in automatic piano transcription have enabled large scale analysis of piano music in the symbolic domain. However, the research has largely focused on classical piano music. We present PiJAMA (Piano Jazz with Automatic MIDI Annotations): a dataset of over 200 hours of solo jazz piano performances with automatically transcribed MIDI. In total there are 2,777 unique performances by 120 different pianists across 244 recorded albums. The dataset contains a mixture of studio recordings and live performances. We use automatic audio tagging to identify applause, spoken introductions, and other non-piano audio to facilitate downstream music information retrieval tasks. We explore descriptive statistics of the MIDI data, including pitch histograms and chromaticism. We then demonstrate two experimental benchmarks on the data: performer identification and generative modeling. The dataset, including a link to the associated source code is available at https://almostimplemented.github.io/PiJAMA/.

Keywords