Boosting the Performance of Deep Ear Recognition Systems Using Generative Adversarial Networks and Mean Class Activation Maps

Rafik Bouaouina; Amir Benzaoui; Hakim Doghmane; Youcef Brik

doi:10.3390/app14104162

Applied Sciences (May 2024)

Boosting the Performance of Deep Ear Recognition Systems Using Generative Adversarial Networks and Mean Class Activation Maps

Rafik Bouaouina,
Amir Benzaoui,
Hakim Doghmane,
Youcef Brik

Affiliations

Rafik Bouaouina: Electronic and Telecommunications Department, Université du 8 Mai 1945, Guelma 24000, Algeria
Amir Benzaoui: Electrical Engineering Department, University of Skikda, BP 26, El Hadaiek, Skikda 21000, Algeria
Hakim Doghmane: Electronic and Telecommunications Department, Université du 8 Mai 1945, Guelma 24000, Algeria
Youcef Brik: Electronics Department, University of Mohamed Boudiaf, M’sila 28000, Algeria

DOI: https://doi.org/10.3390/app14104162
Journal volume & issue: Vol. 14, no. 10
p. 4162

Abstract

Read online

Ear recognition is a complex research domain within biometrics, aiming to identify individuals using their ears in uncontrolled conditions. Despite the exceptional performance of convolutional neural networks (CNNs) in various applications, the efficacy of deep ear recognition systems is nascent. This paper proposes a two-step ear recognition approach. The initial step employs deep convolutional generative adversarial networks (DCGANs) to enhance ear images. This involves the colorization of grayscale images and the enhancement of dark shades, addressing visual imperfections. Subsequently, a feature extraction and classification technique, referred to as Mean-CAM-CNN, is introduced. This technique leverages mean-class activation maps in conjunction with CNNs. The Mean-CAM approach directs the CNN to focus specifically on relevant information, extracting and assessing only significant regions within the entire image. The process involves the implementation of a mask to selectively crop the pertinent area of the image. The cropped region is then utilized to train a CNN for discriminative classification. Extensive evaluations were conducted using two ear recognition datasets: mathematical analysis of images (MAI) and annotated web ears (AWEs). The experimental results indicate that the proposed approach shows notable improvements and competitive performance: the Rank-1 recognition rates are 100.00% and 76.25% for MAI and AWE datasets, respectively.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords