Understanding How CNNs Recognize Facial Expressions: A Case Study with LIME and CEM

Guillermo del Castillo Torres; Maria Francesca Roig-Maimó; Miquel Mascaró-Oliver; Esperança Amengual-Alcover; Ramon Mas-Sansó

doi:10.3390/s23010131

Sensors (Dec 2022)

Understanding How CNNs Recognize Facial Expressions: A Case Study with LIME and CEM

Guillermo del Castillo Torres,
Maria Francesca Roig-Maimó,
Miquel Mascaró-Oliver,
Esperança Amengual-Alcover,
Ramon Mas-Sansó

Affiliations

Guillermo del Castillo Torres: Department of Mathematics and Computer Science, University of the Balearic Islands, 07122 Palma, Spain
Maria Francesca Roig-Maimó: Department of Mathematics and Computer Science, University of the Balearic Islands, 07122 Palma, Spain
Miquel Mascaró-Oliver: Department of Mathematics and Computer Science, University of the Balearic Islands, 07122 Palma, Spain
Esperança Amengual-Alcover: Department of Mathematics and Computer Science, University of the Balearic Islands, 07122 Palma, Spain
Ramon Mas-Sansó: Department of Mathematics and Computer Science, University of the Balearic Islands, 07122 Palma, Spain

DOI: https://doi.org/10.3390/s23010131
Journal volume & issue: Vol. 23, no. 1
p. 131

Abstract

Read online

Recognizing facial expressions has been a persistent goal in the scientific community. Since the rise of artificial intelligence, convolutional neural networks (CNN) have become popular to recognize facial expressions, as images can be directly used as input. Current CNN models can achieve high recognition rates, but they give no clue about their reasoning process. Explainable artificial intelligence (XAI) has been developed as a means to help to interpret the results obtained by machine learning models. When dealing with images, one of the most-used XAI techniques is LIME. LIME highlights the areas of the image that contribute to a classification. As an alternative to LIME, the CEM method appeared, providing explanations in a way that is natural for human classification: besides highlighting what is sufficient to justify a classification, it also identifies what should be absent to maintain it and to distinguish it from another classification. This study presents the results of comparing LIME and CEM applied over complex images such as facial expression images. While CEM could be used to explain the results on images described with a reduced number of features, LIME would be the method of choice when dealing with images described with a huge number of features.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords