EAV: EEG-Audio-Video Dataset for Emotion Recognition in Conversational Contexts

Min-Ho Lee; Adai Shomanov; Balgyn Begim; Zhuldyz Kabidenova; Aruna Nyssanbay; Adnan Yazici; Seong-Whan Lee

doi:10.1038/s41597-024-03838-4

Scientific Data (Sep 2024)

EAV: EEG-Audio-Video Dataset for Emotion Recognition in Conversational Contexts

Min-Ho Lee,
Adai Shomanov,
Balgyn Begim,
Zhuldyz Kabidenova,
Aruna Nyssanbay,
Adnan Yazici,
Seong-Whan Lee

Affiliations

Min-Ho Lee: Nazarbayev University, Department of Computer Science
Adai Shomanov: Nazarbayev University, Department of Computer Science
Balgyn Begim: Nazarbayev University, Department of Computer Science
Zhuldyz Kabidenova: Nazarbayev University, Department of Computer Science
Aruna Nyssanbay: Nazarbayev University, Department of Computer Science
Adnan Yazici: Nazarbayev University, Department of Computer Science
Seong-Whan Lee: Korea University, Department of Artificial Intelligence

DOI: https://doi.org/10.1038/s41597-024-03838-4
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Understanding emotional states is pivotal for the development of next-generation human-machine interfaces. Human behaviors in social interactions have resulted in psycho-physiological processes influenced by perceptual inputs. Therefore, efforts to comprehend brain functions and human behavior could potentially catalyze the development of AI models with human-like attributes. In this study, we introduce a multimodal emotion dataset comprising data from 30-channel electroencephalography (EEG), audio, and video recordings from 42 participants. Each participant engaged in a cue-based conversation scenario, eliciting five distinct emotions: neutral, anger, happiness, sadness, and calmness. Throughout the experiment, each participant contributed 200 interactions, which encompassed both listening and speaking. This resulted in a cumulative total of 8,400 interactions across all participants. We evaluated the baseline performance of emotion recognition for each modality using established deep neural network (DNN) methods. The Emotion in EEG-Audio-Visual (EAV) dataset represents the first public dataset to incorporate three primary modalities for emotion recognition within a conversational context. We anticipate that this dataset will make significant contributions to the modeling of the human emotional process, encompassing both fundamental neuroscience and machine learning viewpoints.

Published in Scientific Data

ISSN: 2052-4463 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/sdata/

About the journal