SunoCaps: A novel dataset of text-prompt based AI-generated music with emotion annotations

M. Civit; V. Drai-Zerbib; D. Lizcano; M.J. Escalona

Data in Brief (Aug 2024)

SunoCaps: A novel dataset of text-prompt based AI-generated music with emotion annotations

M. Civit,
V. Drai-Zerbib,
D. Lizcano,
M.J. Escalona

Affiliations

M. Civit: Department of Communication and Education, Universidad Loyola Andalucía. Av. de las Universidades s/n. 41704 Sevilla, Spain; LEAD - CNRS UMR5022 Université Bourgogne Institut Marey - I3M, 64 rue de Sully, Dijon 21000, France; Universidad de Sevilla, ETS Ingeniería Informática, Avda. Reina Mercedes s/n, Seville 41012, Spain; Corresponding author.
V. Drai-Zerbib: LEAD - CNRS UMR5022 Université Bourgogne Institut Marey - I3M, 64 rue de Sully, Dijon 21000, France
D. Lizcano: Universidad a Distancia de Madrid, Carretera de La Coruña, KM.38,500 Vía de Servicio, n° 15, Collado Villalba, Madrid 28400, Spain
M.J. Escalona: Universidad de Sevilla, ETS Ingeniería Informática, Avda. Reina Mercedes s/n, Seville 41012, Spain

Journal volume & issue: Vol. 55
p. 110743

Abstract

Read online

The SunoCaps dataset aims to provide an innovative contribution to music data. Expert description of human-made musical pieces, from the widely used MusicCaps dataset, are used as prompts for generating complete songs for this dataset. This Automatic Music Generation is done with the state-of-the-art Suno generator of audio-based music. A subset of 64 pieces from MusicCaps is currently included, with a total of 256 generated entries. This total stems from generating four different variations for each human piece; two versions based on the original caption and two versions based on the original aspect description.As an AI-generated music dataset, SunoCaps also includes expert-based information on prompt alignment, with the main differences between prompt and final generation annotated. Furthermore, annotations describing the main discrete emotions induced by the piece. This dataset can have an array of implementations, such as creating and improving music generation validation tools, training systems for multi-layered architectures and the optimization of music emotion estimation systems.

Published in Data in Brief

ISSN: 2352-3409 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Science (General)
Website: http://www.journals.elsevier.com/data-in-brief/

About the journal

Abstract

Keywords