Dual-Branch Multimodal Fusion Network for Driver Facial Emotion Recognition

Le Wang; Yuchen Chang; Kaiping Wang

doi:10.3390/app14209430

Applied Sciences (Oct 2024)

Dual-Branch Multimodal Fusion Network for Driver Facial Emotion Recognition

Le Wang,
Yuchen Chang,
Kaiping Wang

Affiliations

Le Wang: Department of Computer Science, Xi’an University of Architecture and Technology, Xi’an 710054, China
Yuchen Chang: Department of Computer Science, Xi’an University of Architecture and Technology, Xi’an 710054, China
Kaiping Wang: Department of Computer Science, Xi’an University of Architecture and Technology, Xi’an 710054, China

DOI: https://doi.org/10.3390/app14209430
Journal volume & issue: Vol. 14, no. 20
p. 9430

Abstract

Read online

In the transition to fully automated driving, the interaction between drivers and vehicles is crucial as drivers’ emotions directly influence their behavior, thereby impacting traffic safety. Currently, relying solely on a backbone based on a convolutional neural network (CNN) to extract single RGB modal facial features makes it difficult to capture enough semantic information. To address this issue, this paper proposes a Dual-branch Multimodal Fusion Network (DMFNet). DMFNet extracts semantic features from visible–infrared (RGB-IR) image pairs effectively capturing complementary information between two modalities and achieving a more accurate understanding of the drivers’ emotional state at a global level. However, the accuracy of facial recognition is significantly affected by variations in the drivers’ head posture and light environment. Thus, we further propose a U-Shape Reconstruction Network (URNet) to focus on enhancing and reconstructing the detailed features of RGB modes. Additionally, we design a Detail Enhancement Block (DEB) embedded in a U-shaped reconstruction network for high-frequency filtering. Compared with the original driver emotion recognition model, our method improved the accuracy by 18.77% on the DEFE++ dataset, proving the superiority of the proposed method.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords