Attacking Black-Box Image Classifiers With Particle Swarm Optimization

Quanxin Zhang; Kunqing Wang; Wenjiao Zhang; Jingjing Hu

doi:10.1109/ACCESS.2019.2948146

IEEE Access (Jan 2019)

Attacking Black-Box Image Classifiers With Particle Swarm Optimization

Quanxin Zhang,
Kunqing Wang,
Wenjiao Zhang,
Jingjing Hu

Affiliations

Quanxin Zhang: School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
Kunqing Wang: School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
Wenjiao Zhang: Aerospace Times Feihong Technology Company, Ltd., Beijing, China
Jingjing Hu: ORCiD; School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2019.2948146
Journal volume & issue: Vol. 7
pp. 158051 – 158063

Abstract

Read online

In order to better solve the shortcomings of Deep Neural Networks (DNNs) susceptible to adversarial examples, evaluating existing neural network classification performance and increasing training sets to improve the robustness of classification models require more effective methods of the adversarial examples generation. Under the black-box condition, less information about parameters of the classification model, limited query times, and less feedback information available, it is difficult to generate adversarial examples against the black-box model. In order to further improve the efficiency of the adversarial images generation, we propose two different variants of Partial Swarm Optimization algorithm (vPSO) base on the traditional Partial Swarm Optimization for the targeted and non-targeted attack under conditions of the completely black-box. Compared with the existing of the state-of-the-art generation algorithm, the vPSO effectively reduce the number of queries to the black-box classifier and the dependence on the feedback information. The success rate of the targeted attack is up to 96.0% and the average number of queries for the black-box model is greatly reduced. Furthermore, we propose an efficient target image screening method in targeted attacks, as well as the concept of easy-to-attack and hard-to-attack images in non-targeted attacks, and give corresponding distinctions.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords