Secure machine learning against adversarial samples at test time

Jing Lin; Laurent L. Njilla; Kaiqi Xiong

doi:10.1186/s13635-021-00125-2

EURASIP Journal on Information Security (Jan 2022)

Secure machine learning against adversarial samples at test time

Jing Lin,
Laurent L. Njilla,
Kaiqi Xiong

Affiliations

Jing Lin: ICNS Lab and Cyber Florida, University of South Florida
Laurent L. Njilla: Cyber Assurance Branch, U.S. Air Force Research Laboratory
Kaiqi Xiong: ICNS Lab and Cyber Florida, University of South Florida

DOI: https://doi.org/10.1186/s13635-021-00125-2
Journal volume & issue: Vol. 2022, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Deep neural networks (DNNs) are widely used to handle many difficult tasks, such as image classification and malware detection, and achieve outstanding performance. However, recent studies on adversarial examples, which have maliciously undetectable perturbations added to their original samples that are indistinguishable by human eyes but mislead the machine learning approaches, show that machine learning models are vulnerable to security attacks. Though various adversarial retraining techniques have been developed in the past few years, none of them is scalable. In this paper, we propose a new iterative adversarial retraining approach to robustify the model and to reduce the effectiveness of adversarial inputs on DNN models. The proposed method retrains the model with both Gaussian noise augmentation and adversarial generation techniques for better generalization. Furthermore, the ensemble model is utilized during the testing phase in order to increase the robust test accuracy. The results from our extensive experiments demonstrate that the proposed approach increases the robustness of the DNN model against various adversarial attacks, specifically, fast gradient sign attack, Carlini and Wagner (C&W) attack, Projected Gradient Descent (PGD) attack, and DeepFool attack. To be precise, the robust classifier obtained by our proposed approach can maintain a performance accuracy of 99% on average on the standard test set. Moreover, we empirically evaluate the runtime of two of the most effective adversarial attacks, i.e., C&W attack and BIM attack, to find that the C&W attack can utilize GPU for faster adversarial example generation than the BIM attack can. For this reason, we further develop a parallel implementation of the proposed approach. This parallel implementation makes the proposed approach scalable for large datasets and complex models.

Published in EURASIP Journal on Information Security

ISSN: 2510-523X (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://jis.eurasipjournals.com/

About the journal

Abstract

Keywords