IEEE Access (Jan 2023)

Fine-Tuning Swin Transformer and Multiple Weights Optimality-Seeking for Facial Expression Recognition

  • Hongqi Feng,
  • Weikai Huang,
  • Denghui Zhang,
  • Bangze Zhang

DOI
https://doi.org/10.1109/ACCESS.2023.3237817
Journal volume & issue
Vol. 11
pp. 9995 – 10003

Abstract

Read online

Facial expression recognition plays a key role in human-computer emotional interaction. However, human faces in real environments are affected by various unfavorable factors, which will result in the reduction of expression recognition accuracy. In this paper, we proposed a novel method which combines Fine-tuning Swin Transformer and Multiple Weights Optimality-seeking (FST-MWOS) to enhanced expression recognition performance. FST-MWOS mainly consists of two crucial components: Fine-tuning Swin Transformer (FST) and Multiple Weights Optimality-seeking (MWOS). FST takes Swin Transformer Large as the backbone network to obtain multiple groups of fine-tuned model weights for the homologous data domains by hyperparameters configurations, data augmentation methods, etc. In MWOS a greedy strategy was used to mine locally optimal generalizations in the optimal epoch interval of each group of fine-tuned model weights. Then, the optimality-seeking for multiple groups of locally optimal weights was utilized to obtain the global optimal solution. Experiments results on RAF-DB, FERPlus and AffectNet datasets show that the proposed FST-MWOS method outperforms various state-of-the-art methods.

Keywords