Semantic Face Segmentation Using Convolutional Neural Networks With a Supervised Attention Module

Akiyoshi Hizukuri; Yuto Hirata; Ryohei Nakayama

doi:10.1109/ACCESS.2023.3326420

IEEE Access (Jan 2023)

Semantic Face Segmentation Using Convolutional Neural Networks With a Supervised Attention Module

Akiyoshi Hizukuri,
Yuto Hirata,
Ryohei Nakayama

Affiliations

Akiyoshi Hizukuri: ORCiD; Department of Electronic and Computer Engineering, Ritsumeikan University, Kusatsu-shi, Japan
Yuto Hirata: ORCiD; Department of Electronic and Computer Engineering, Ritsumeikan University, Kusatsu-shi, Japan
Ryohei Nakayama: Department of Electronic and Computer Engineering, Ritsumeikan University, Kusatsu-shi, Japan

DOI: https://doi.org/10.1109/ACCESS.2023.3326420
Journal volume & issue: Vol. 11
pp. 116892 – 116902

Abstract

Read online

A self-attention module is often used in image segmentation tasks such as facial part segmentation. Because the self-attention module weights the features at each position using the weighted sum of features at all positions obtained by the middle layer of a convolutional neural network (CNN), the target regions for the segmentation might not be weighted sufficiently. The purpose of this study was to develop a semantic segmentation method for facial parts using a CNN with a supervised attention module that focuses on facial part enhancement. To improve the segmentation accuracy of the facial parts, we propose a new supervised attention module that can enhance features corresponding to pixels with the same class labels on input images and then incorporated it into the CNN. In this study, ResNet-FCN with skip connections was used as the baseline CNN model, and the CelebA Mask-HQ dataset was used for the network training and evaluation of the network. The mean intersection over union (IoU) and Dice index for the proposed network were greater than those for ResNet-FCN, SegNet, and PSPNet without the self-attention module, non-local neural networks with the traditional self-attention module, Segformer, and U-Segformer-Hyper with a Transformer. The proposed network achieved a high mean IoU and Dice index, and hence will be useful for segmenting facial parts.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords