On the Relationship Between Universal Adversarial Attacks and Sparse Representations

Dana Weitzner; Raja Giryes

doi:10.1109/OJSP.2023.3244486

IEEE Open Journal of Signal Processing (Jan 2023)

On the Relationship Between Universal Adversarial Attacks and Sparse Representations

Dana Weitzner,
Raja Giryes

Affiliations

Dana Weitzner: ORCiD; Electrical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv, Israel
Raja Giryes: ORCiD; Electrical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv, Israel

DOI: https://doi.org/10.1109/OJSP.2023.3244486
Journal volume & issue: Vol. 4
pp. 99 – 107

Abstract

Read online

The prominent success of neural networks, mainly in computer vision tasks, is increasingly shadowed by their sensitivity to small, barely perceivable adversarial perturbations in image input. In this article, we aim at explaining this vulnerability through the framework of sparsity. We show the connection between adversarial attacks and sparse representations, with a focus on explaining the universality and transferability of adversarial examples in neural networks. To this end, we show that sparse coding algorithms, and the neural network-based learned iterative shrinkage thresholding algorithm (LISTA) among them, suffer from this sensitivity, and that common attacks on neural networks can be expressed as attacks on the sparse representation of the input image. The phenomenon that we observe holds true also when the network is agnostic to the sparse representation and dictionary, and thus can provide a possible explanation for the universality and transferability of adversarial attacks.

Published in IEEE Open Journal of Signal Processing

ISSN: 2644-1322 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8782710

About the journal

Abstract

Keywords