Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals

Jan Pešán; Vojtěch Juřík; Alexandra Ružičková; Vojtěch Svoboda; Oto Janoušek; Andrea Němcová; Hana Bojanovská; Jasmína Aldabaghová; Filip Kyslík; Kateřina Vodičková; Adéla Sodomová; Patrik Bartys; Peter Chudý; Jan Černocký

doi:10.1038/s41597-024-03991-w

Scientific Data (Nov 2024)

Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals

Jan Pešán,
Vojtěch Juřík,
Alexandra Ružičková,
Vojtěch Svoboda,
Oto Janoušek,
Andrea Němcová,
Hana Bojanovská,
Jasmína Aldabaghová,
Filip Kyslík,
Kateřina Vodičková,
Adéla Sodomová,
Patrik Bartys,
Peter Chudý,
Jan Černocký

Affiliations

Jan Pešán: Speech@FIT, Faculty of Information Technology, Brno University of Technology
Vojtěch Juřík: Department of Psychology, Faculty of Arts, Masaryk University
Alexandra Ružičková: Department of Psychology, Faculty of Arts, Masaryk University
Vojtěch Svoboda: Department of Psychology, Faculty of Arts, Masaryk University
Oto Janoušek: Department of Biomedical Engineering, Faculty of Electrical Engineering and Communication, Brno University of Technology
Andrea Němcová: Department of Biomedical Engineering, Faculty of Electrical Engineering and Communication, Brno University of Technology
Hana Bojanovská: Department of Psychology, Faculty of Arts, Masaryk University
Jasmína Aldabaghová: Department of Psychology, Faculty of Arts, Masaryk University
Filip Kyslík: Department of Psychology, Faculty of Arts, Masaryk University
Kateřina Vodičková: Department of Psychology, Faculty of Arts, Masaryk University
Adéla Sodomová: Department of Psychology, Faculty of Arts, Masaryk University
Patrik Bartys: Department of Psychology, Faculty of Arts, Masaryk University
Peter Chudý: Speech@FIT, Faculty of Information Technology, Brno University of Technology
Jan Černocký: Speech@FIT, Faculty of Information Technology, Brno University of Technology

DOI: https://doi.org/10.1038/s41597-024-03991-w
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Early identification of cognitive or physical overload is critical in fields where human decision making matters when preventing threats to safety and property. Pilots, drivers, surgeons, and operators of nuclear plants are among those affected by this challenge, as acute stress can impair their cognition. In this context, the significance of paralinguistic automatic speech processing increases for early stress detection. The intensity, intonation, and cadence of an utterance are examples of paralinguistic traits that determine the meaning of a sentence and are often lost in the verbatim transcript. To address this issue, tools are being developed to recognize paralinguistic traits effectively. However, a data bottleneck still exists in the training of paralinguistic speech traits, and the lack of high-quality reference data for the training of artificial systems persists. Regarding this, we present an original empirical dataset collected using the BESST experimental protocol for capturing speech signals under induced stress. With this data, our aim is to promote the development of pre-emptive intervention systems based on stress estimation from speech.

Published in Scientific Data

ISSN: 2052-4463 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/sdata/

About the journal