A Transfer Learning Approach for Facial Paralysis Severity Detection

Wasif Ali; Muhammad Imran; Muhammad Usman Yaseen; Khursheed Aurangzeb; Nouman Ashraf; Sheraz Aslam

doi:10.1109/ACCESS.2023.3330242

IEEE Access (Jan 2023)

A Transfer Learning Approach for Facial Paralysis Severity Detection

Wasif Ali,
Muhammad Imran,
Muhammad Usman Yaseen,
Khursheed Aurangzeb,
Nouman Ashraf,
Sheraz Aslam

Affiliations

Wasif Ali: Department of Computer Science, COMSATS University Islamabad (CUI), Islamabad, Pakistan
Muhammad Imran: ORCiD; Department of Computer Science, COMSATS University Islamabad (CUI), Islamabad, Pakistan
Muhammad Usman Yaseen: ORCiD; Department of Computer Science, COMSATS University Islamabad (CUI), Islamabad, Pakistan
Khursheed Aurangzeb: ORCiD; Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
Nouman Ashraf: ORCiD; School of Electrical and Electronic Engineering, Technological University Dublin, Dublin, Ireland
Sheraz Aslam: ORCiD; Department of Electrical Engineering, Computer Engineering and Informatics, Cyprus University of Technology, Limassol, Cyprus

DOI: https://doi.org/10.1109/ACCESS.2023.3330242
Journal volume & issue: Vol. 11
pp. 127492 – 127508

Abstract

Read online

Facial paralysis is a debilitating condition that weakens or damages facial muscles resulting in asymmetric or abnormal facial movements. To aid in the diagnosis and rehabilitation of facial paralysis, researchers have developed machine learning and deep learning computer-aided diagnosis systems. However, machine learning models have limitations as they rely on facial landmark techniques and manual face palsy region extraction methods to obtain spatial information. Moreover, deep learning models need large, labelled datasets for training whereas existing available facial paralysis datasets are small and restricted. This presents significant challenges, including difficulties in data acquisition, insufficient patient numbers, and inadequate diversity within the datasets. These limitations can potentially restrict the generalizability of these models and introduce biases in the resulting outcomes. In this study, we propose an approach for the diagnosis and grading of facial paralysis comprised of two datasets, one from MEEI (Massachusetts Eye and Ear Infirmary) videos of patients and the other from the YFP (YouTube Face Palsy) dataset. The model uses a transfer learning approach to fine-tune the VGGFace model, which is pre-trained on facial images, on the prepared datasets for facial paralysis. The resultant model was subsequently renamed as FP-VGGFace for the purpose of this research. Additionally, two more pre-trained models on facial images, ResNet50 and VGG16, are also fine-tuned for the facial paralysis task. This was undertaken to conduct a performance comparison of multiple models on the prepared dataset. The findings indicate that the models exhibit high accuracy, benefiting from pre-training on a diverse dataset that enables the capture of spatial information from facial images. The FP-VGGFace model achieves the best accuracy (99.3%) and F1-score (99.3%) surpassing all benchmark models. This study underscores the potential of utilizing pre-trained deep learning models for the diagnosis and rehabilitation of facial paralysis.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords