Enhancing Noisy Label Facial Expression Recognition With Split and Merge Consistency Regularization

Jihyun Kim; Junehyoung Kwon; Mihyeon Kim; Eunju Lee; Youngbin Kim

doi:10.1109/ACCESS.2023.3339763

IEEE Access (Jan 2023)

Enhancing Noisy Label Facial Expression Recognition With Split and Merge Consistency Regularization

Jihyun Kim,
Junehyoung Kwon,
Mihyeon Kim,
Eunju Lee,
Youngbin Kim

Affiliations

Jihyun Kim: Department of Artificial Intelligence, Chung-Ang University, Dongjak, South Korea
Junehyoung Kwon: Department of Artificial Intelligence, Chung-Ang University, Dongjak, South Korea
Mihyeon Kim: Department of Artificial Intelligence, Chung-Ang University, Dongjak, South Korea
Eunju Lee: ORCiD; Department of Imaging Science, Multimedia and Film, Chung-Ang University, Dongjak, South Korea
Youngbin Kim: ORCiD; Department of Artificial Intelligence, Chung-Ang University, Dongjak, South Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3339763
Journal volume & issue: Vol. 11
pp. 140496 – 140505

Abstract

Read online

Facial expression recognition (FER) has been extensively studied in various applications over the past few years. However, in real facial expression datasets, labels can become noisy due to the ambiguity of expressions, the similarity between classes, and the subjectivity of annotators. These noisy labels negatively affect FER and significantly reduce classification performance. In previous methods, overfitting can occur as the noise ratio increases. To solve this problem, we propose the split and merge consistency regularization (SMEC) method that is robust to noisy labels by examining various image regions rather than just one part of facial expression images without negatively affecting the meaning. We split facial expression images into two images and input them into the backbone network to extract class activation maps (CAMs). This approach merges two CAMs and improves robustness to noisy labels by normalizing the consistency between the CAM of the original image and the merged CAM. The proposed SMEC method aims to improve FER performance and robustness against highly noisy labels by preventing the model from focusing on only a single part without losing the semantics of the facial expression images. The SMEC method demonstrates robust performance over state-of-the-art noisy label FER models on an unbalanced facial expression dataset called the real-world affective faces database (RAF-DB) regarding class-wise accuracy for clean and noisy labels, even at severe noise rates of 40% to 60%.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords