Mape: defending against transferable adversarial attacks using multi-source adversarial perturbations elimination

Xinlei Liu; Jichao Xie; Tao Hu; Peng Yi; Yuxiang Hu; Shumin Huo; Zhen Zhang

doi:10.1007/s40747-024-01770-z

Complex & Intelligent Systems (Jan 2025)

Mape: defending against transferable adversarial attacks using multi-source adversarial perturbations elimination

Xinlei Liu,
Jichao Xie,
Tao Hu,
Peng Yi,
Yuxiang Hu,
Shumin Huo,
Zhen Zhang

Affiliations

Xinlei Liu: Information Engineering University
Jichao Xie: Information Engineering University
Tao Hu: Information Engineering University
Peng Yi: Information Engineering University
Yuxiang Hu: Information Engineering University
Shumin Huo: Information Engineering University
Zhen Zhang: Information Engineering University

DOI: https://doi.org/10.1007/s40747-024-01770-z
Journal volume & issue: Vol. 11, no. 2
pp. 1 – 17

Abstract

Read online

Abstract Neural networks are vulnerable to meticulously crafted adversarial examples, leading to high-confidence misclassifications in image classification tasks. Due to their consistency with regular input patterns and the absence of reliance on the target model and its output information, transferable adversarial attacks exhibit a notably high stealthiness and detection difficulty, making them a significant focus of defense. In this work, we propose a deep learning defense known as multi-source adversarial perturbations elimination (MAPE) to counter diverse transferable attacks. MAPE comprises the single-source adversarial perturbation elimination (SAPE) mechanism and the pre-trained models probabilistic scheduling algorithm (PPSA). SAPE utilizes a thoughtfully designed channel-attention U-Net as the defense model and employs adversarial examples generated by a pre-trained model (e.g., ResNet) for its training, thereby enabling the elimination of known adversarial perturbations. PPSA introduces model difference quantification and negative momentum to strategically schedule multiple pre-trained models, thereby maximizing the differences among adversarial examples during the defense model’s training and enhancing its robustness in eliminating adversarial perturbations. MAPE effectively eliminates adversarial perturbations in various adversarial examples, providing a robust defense against attacks from different substitute models. In a black-box attack scenario utilizing ResNet-34 as the target model, our approach achieves average defense rates of over 95.1% on CIFAR-10 and over 71.5% on Mini-ImageNet, demonstrating state-of-the-art performance.

Published in Complex & Intelligent Systems

ISSN: 2199-4536 (Print); 2198-6053 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.springer.com/journal/40747

About the journal

Abstract

Keywords