Developing Hessian–Free Second–Order Adversarial Examples for Adversarial Training

Qian Yaguan; Zhang Liangjian; Wang Yuqi; Ji Boyuan; Yao Tengteng; Wang Bin

doi:10.61822/amcs-2024-0030

International Journal of Applied Mathematics and Computer Science (Sep 2024)

Developing Hessian–Free Second–Order Adversarial Examples for Adversarial Training

Qian Yaguan,
Zhang Liangjian,
Wang Yuqi,
Ji Boyuan,
Yao Tengteng,
Wang Bin

Affiliations

Qian Yaguan: College of Science, Zhejiang University of Science and Technology, Hangzhou310023, China
Zhang Liangjian: College of Science, Zhejiang University of Science and Technology, Hangzhou310023, China
Wang Yuqi: College of Science, Zhejiang University of Science and Technology, Hangzhou310023, China
Ji Boyuan: College of Science, Zhejiang University of Science and Technology, Hangzhou310023, China
Yao Tengteng: College of Science, Zhejiang University of Science and Technology, Hangzhou310023, China
Wang Bin: Zhejiang Key Laboratory of Artificial Intelligence of Things (AIoT) Network and Data Security, Hangzhou310052, China

DOI: https://doi.org/10.61822/amcs-2024-0030
Journal volume & issue: Vol. 34, no. 3
pp. 425 – 438

Abstract

Read online

Recent studies show that deep neural networks (DNNs) are extremely vulnerable to elaborately designed adversarial examples. Adversarial training, which uses adversarial examples as training data, has been proven to be one of the most effective methods of defense against adversarial attacks. However, most existing adversarial training methods use adversarial examples relying on first-order gradients, which perform poorly against second-order adversarial attacks and make it difficult to further improve the robustness of the model. In contrast to first-order gradients, second-order gradients provide a more accurate approximation of the loss landscape relative to natural examples. Therefore, our work focuses on constructing second-order adversarial examples and utilizing them for training DNNs. However, second-order optimization involves computing the Hessian inverse, which typically consumes considerable time. To address this issue, we propose an approximation method that transforms the problem into optimization within the Krylov subspace. Compared with the Euclidean space, the Krylov subspace method typically does not require storing the entire matrix. It only needs to store vectors and intermediate results, avoiding explicitly calculating the complete Hessian matrix. We approximate the adversarial direction by a linear combination of Hessian-vector products in the Krylov subspace to reduce the computation cost. Because of the non-symmetrical Hessian matrix, we use the generalized minimum residual to search for an approximate polynomial solution of the matrix. Our method efficiently reduces computational complexity and accelerates the training process. Extensive experiments conducted on the MNIST, CIFAR-10, and ImageNet-100 datasets demonstrate that our adversarial learning using second-order adversarial samples outperforms other first-order methods, leading to improved model robustness against various attacks.

Published in International Journal of Applied Mathematics and Computer Science

ISSN: 2083-8492 (Online)
Publisher: Sciendo
Country of publisher: Poland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.amcs.uz.zgora.pl/

About the journal

Abstract

Keywords