Foundations of Computing and Decision Sciences (Feb 2024)

DefenseFea: An Input Transformation Feature Searching Algorithm Based Latent Space for Adversarial Defense

  • Pan Zhang,
  • Yangjie Cao,
  • Chenxi Zhu,
  • Yan Zhuang,
  • Haobo Wang,
  • Jie Li

DOI
https://doi.org/10.2478/fcds-2024-0002
Journal volume & issue
Vol. 49, no. 1
pp. 21 – 36

Abstract

Read online

Deep neural networks based image classification systems could suffer from adversarial attack algorithms, which generate input examples by adding deliberately crafted yet imperceptible noise to original input images. These crafted examples can fool systems and further threaten their security. In this paper, we propose to use latent space protect image classification. Specifically, we train a feature searching network to make up the difference between adversarial examples and clean examples with label guided loss function. We name it DefenseFea(input transformation based defense with label guided loss function), experimental result shows that DefenseFea can improve the rate of adversarial examples that achieved a success rate of about 99% on a specific set of 5000 images from ILSVRC 2012. This study plays a positive role in the further investigation of the relationship between adversarial examples and clean examples.

Keywords