Adversarial examples detection through the sensitivity in space mappings

Xurong Li; Shouling Ji; Juntao Ji; Zhenyu Ren; Chunming Wu; Bo Li; Ting Wang

doi:10.1049/iet-cvi.2019.0378

IET Computer Vision (Aug 2020)

Adversarial examples detection through the sensitivity in space mappings

Xurong Li,
Shouling Ji,
Juntao Ji,
Zhenyu Ren,
Chunming Wu,
Bo Li,
Ting Wang

Affiliations

Xurong Li: Department of Computer Science and TechnologyZhejiang UniversityHangzhouPeople's Republic of China
Shouling Ji: Department of Computer Science and TechnologyZhejiang UniversityHangzhouPeople's Republic of China
Juntao Ji: Department of Computer Science and TechnologyZhejiang UniversityHangzhouPeople's Republic of China
Zhenyu Ren: Department of Computer Science and TechnologyZhejiang UniversityHangzhouPeople's Republic of China
Chunming Wu: Department of Computer Science and TechnologyZhejiang UniversityHangzhouPeople's Republic of China
Bo Li: Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaUSA
Ting Wang: Department of Computer ScienceLehigh UniversityBethlehemUSA

DOI: https://doi.org/10.1049/iet-cvi.2019.0378
Journal volume & issue: Vol. 14, no. 5
pp. 201 – 213

Abstract

Read online

Adversarial examples (AEs) against deep neural networks (DNNs) raise wide concerns about the robustness of DNNs. Existing detection mechanisms are often limited to a given attack algorithm. Therefore, it is highly desirable to develop a robust detection approach that remains effective for a large group of attack algorithms. In addition, most of the existing defences only perform well for small images (e.g. MNIST and Canadian institute for advanced research (CIFAR)) rather than large images (e.g. ImageNet). In this paper, the authors propose a robust and effective defence method for analysing the sensitivity of various AEs, especially in a much harder case (large images). Their method first creates a feature map from the input space to the new feature space, by utilising 19 different feature mapping methods. Then, a detector is learned with the machine‐learning algorithm to recognise the unique distribution of AEs. Their extensive evaluations on their proposed detector show that their detector can achieve: (i) low false‐positive rate (<1%), (ii) high true‐positive rate (higher than 98%), (iii) low overhead (<0.1 s per input), and (iv) good robustness (work well across different learning models, attack algorithms, and parameters), which demonstrate the efficacy of the proposed detector in practise.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords