Applied Sciences (May 2022)

Novel Exploit Feature-Map-Based Detection of Adversarial Attacks

  • Ali Saeed Almuflih,
  • Dhairya Vyas,
  • Viral V. Kapdia,
  • Mohamed Rafik Noor Mohamed Qureshi,
  • Karishma Mohamed Rafik Qureshi,
  • Elaf Abdullah Makkawi

DOI
https://doi.org/10.3390/app12105161
Journal volume & issue
Vol. 12, no. 10
p. 5161

Abstract

Read online

In machine learning (ML), adversarial attack (targeted or untargeted) in the presence of noise disturbs the model prediction. This research suggests that adversarial perturbations on pictures lead to noise in the features constructed by any networks. As a result, adversarial assaults against image categorization systems may present obstacles and possibilities for studying convolutional neural networks (CNNs). According to this research, adversarial perturbations on pictures cause noise in the features created by neural networks. Motivated by adversarial perturbation on image pixel attacks observation, we developed a novel exploit feature map that describes adversarial attacks by performing individual object feature-map visual description. Specifically, a novel detection algorithm calculates each object’s class activation map weight and makes a combined activation map. When checked with different networks like VGGNet19 and ResNet50, in both white-box and black-box attack situations, the unique exploit feature-map significantly improves the state-of-the-art in adversarial resilience. Further, it will clearly exploit attacks on ImageNet under various algorithms like Fast Gradient Sign Method (FGSM), DeepFool, Projected Gradient Descent (PGD), and Backward Pass Differentiable Approximation (BPDA).

Keywords