Modelling appearance variations in expressive and neutral face image for automatic facial expression recognition

Naveen Kumar H N; Guru Prasad M S; Mohd Asif Shah; Mahadevaswamy; Jagadeesh B; Sudheesh K V

doi:10.1049/ipr2.13109

IET Image Processing (Jul 2024)

Modelling appearance variations in expressive and neutral face image for automatic facial expression recognition

Naveen Kumar H N,
Guru Prasad M S,
Mohd Asif Shah,
Mahadevaswamy,
Jagadeesh B,
Sudheesh K V

Affiliations

Naveen Kumar H N: Department of Electronics and Communication EngineeringVidyavardhaka College of EngineeringMysuru Karnataka India
Guru Prasad M S: Department of Computer Science and EngineeringGraphic Era (Deemed to be University)Dehradun India
Mohd Asif Shah: Department of Economics, College of Business and EconomicsKebri Dehar UniversityKebri Dehar Somali Ethiopia
Mahadevaswamy: Department of Electronics and Communication EngineeringVidyavardhaka College of EngineeringMysuru Karnataka India
Jagadeesh B: Department of Electronics and Communication EngineeringVidyavardhaka College of EngineeringMysuru Karnataka India
Sudheesh K V: Department of Electronics and Communication EngineeringVidyavardhaka College of EngineeringMysuru Karnataka India

DOI: https://doi.org/10.1049/ipr2.13109
Journal volume & issue: Vol. 18, no. 9
pp. 2449 – 2460

Abstract

Read online

Abstract In automatic facial expression recognition (AFER) systems, modelling the spatio‐temporal feature information in a specific manner, coalescing, and its effective utilization is challenging. The state‐of‐the‐art studies have examined integrating multiple features to enhance the recognition rate of AFER systems. However, the feature variations between expressive and neutral face images are not fully explored to identify the expression class. The proposed research presents an innovative approach to AFER by modelling appearance variations in both expressive and neutral face images. The prominent contributions of the work are developing a novel and hybrid feature space by integrating the discriminative feature distribution derived from expressive and neutral face images; preserving the highly discriminative latent feature distribution using autoencoders. Local binary pattern (LBP) and histogram of oriented gradients (HOG) are the feature descriptors employed to derive the discriminative texture and shape information, respectively. The component‐based approach is employed, wherein the features are derived from the salient facial regions instead of the whole face. The three‐stage stacked deep convolutional autoencoder (SDCA) and multi‐class support vector machine (MSVM) are employed to address dimensionality reduction and classification, respectively. The efficacy of the proposed model is substantiated by empirical findings, which establish its superiority in terms of accuracy in AFER tasks on widely recognized benchmark datasets.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords