GCANet: Geometry cues-aware facial expression recognition based on graph convolutional networks

Shutong Wang; Anran Zhao; Chenghang Lai; Qi Zhang; Duantengchuan Li; Yihua Gao; Liangshan Dong; Xiaoguang Wang

Journal of King Saud University: Computer and Information Sciences (Jul 2023)

GCANet: Geometry cues-aware facial expression recognition based on graph convolutional networks

Shutong Wang,
Anran Zhao,
Chenghang Lai,
Qi Zhang,
Duantengchuan Li,
Yihua Gao,
Liangshan Dong,
Xiaoguang Wang

Affiliations

Shutong Wang: School of Information Management, Wuhan University, Wuhan 430072, China; National Engineering Research Center for E-Learning, Central China Normal University, Wuhan 430079, China
Anran Zhao: School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
Chenghang Lai: School of Computer Science, Fudan University, Shanghai 200438, China
Qi Zhang: School of Information Management, Central China Normal University, Wuhan 430079, China; Corresponding author.
Duantengchuan Li: School of Computer Science, Wuhan University, Wuhan 430072, China
Yihua Gao: Key Laboratory of Sports Engineering of General Administration of Sport of China, Wuhan Sports University, Wuhan 430079, China
Liangshan Dong: School of Physical Education, China University of Geosciences, Wuhan 430074, China
Xiaoguang Wang: School of Information Management, Wuhan University, Wuhan 430072, China

Journal volume & issue: Vol. 35, no. 7
p. 101605

Abstract

Read online

Facial expression recognition (FER) task in the wild is challenging due to some uncertainties, such as the ambiguity of facial expressions, subjective annotations, and low-quality facial images. A novel model for FER in-the-wild datasets is proposed in this study to solve these uncertainties. The overview of the proposed method is as follows. First, the facial images are grouped into high and low uncertainties by the pre-trained network. The graph convolutional network (GCN) framework is then used for the facial images with low uncertainty to obtain geometry cues, including the relationship among action units (AUs) and the implicit connection between AUs and expressions, which help predict the probability of the underlying emotional label. The emotion label distribution is produced by combining the predicted latent label probability and the given label. For the facial images with high uncertainty, k-nearest neighbor graphs are built to determine the k facial images in the low uncertainty group with the highest similarity to the given facial image. The emotion label distribution of the given image is then replaced by fusing the emotion label distribution based on the distances between the given image and its adjacent images. Finally, the constructed emotion label distribution facilitates training in a straightforward manner using a convolutional neural network framework to identify facial expressions. Experimental results on RAF-DB, FERPlus, AffectNet, and SFEW2.0 datasets demonstrate that the proposed method achieved superior performance compared to state-of-the-art approaches.

Published in Journal of King Saud University: Computer and Information Sciences

ISSN: 1319-1578 (Print)
Publisher: Elsevier
Country of publisher: Saudi Arabia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.journals.elsevier.com/journal-of-king-saud-university-computer-and-information-sciences/

About the journal

Abstract

Keywords