Novel Exploit Feature-Map-Based Detection of Adversarial Attacks

Ali Saeed Almuflih; Dhairya Vyas; Viral V. Kapdia; Mohamed Rafik Noor Mohamed Qureshi; Karishma Mohamed Rafik Qureshi; Elaf Abdullah Makkawi

doi:10.3390/app12105161

Applied Sciences (May 2022)

Novel Exploit Feature-Map-Based Detection of Adversarial Attacks

Ali Saeed Almuflih,
Dhairya Vyas,
Viral V. Kapdia,
Mohamed Rafik Noor Mohamed Qureshi,
Karishma Mohamed Rafik Qureshi,
Elaf Abdullah Makkawi

Affiliations

Ali Saeed Almuflih: Industrial Engineering Department, King Khalid University, Abha 62529, Saudi Arabia
Dhairya Vyas: Computer Science and Engineering Department, The Maharaja Sayajirao University of Baroda, Vadodara 390002, India
Viral V. Kapdia: Computer Science and Engineering Department, The Maharaja Sayajirao University of Baroda, Vadodara 390002, India
Mohamed Rafik Noor Mohamed Qureshi: Industrial Engineering Department, King Khalid University, Abha 62529, Saudi Arabia
Karishma Mohamed Rafik Qureshi: Department of Mechanical Engineering, Parul University, Waghodia 391760, India
Elaf Abdullah Makkawi: Industrial Engineering and Management System, University of Central Florida, Orlando, FL 32816, USA

DOI: https://doi.org/10.3390/app12105161
Journal volume & issue: Vol. 12, no. 10
p. 5161

Abstract

Read online

In machine learning (ML), adversarial attack (targeted or untargeted) in the presence of noise disturbs the model prediction. This research suggests that adversarial perturbations on pictures lead to noise in the features constructed by any networks. As a result, adversarial assaults against image categorization systems may present obstacles and possibilities for studying convolutional neural networks (CNNs). According to this research, adversarial perturbations on pictures cause noise in the features created by neural networks. Motivated by adversarial perturbation on image pixel attacks observation, we developed a novel exploit feature map that describes adversarial attacks by performing individual object feature-map visual description. Specifically, a novel detection algorithm calculates each object’s class activation map weight and makes a combined activation map. When checked with different networks like VGGNet19 and ResNet50, in both white-box and black-box attack situations, the unique exploit feature-map significantly improves the state-of-the-art in adversarial resilience. Further, it will clearly exploit attacks on ImageNet under various algorithms like Fast Gradient Sign Method (FGSM), DeepFool, Projected Gradient Descent (PGD), and Backward Pass Differentiable Approximation (BPDA).

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords