Exploring the Efficacy of Learning Techniques in Model Extraction Attacks on Image Classifiers: A Comparative Study

Dong Han; Reza Babaei; Shangqing Zhao; Samuel Cheng

doi:10.3390/app14093785

Applied Sciences (Apr 2024)

Exploring the Efficacy of Learning Techniques in Model Extraction Attacks on Image Classifiers: A Comparative Study

Dong Han,
Reza Babaei,
Shangqing Zhao,
Samuel Cheng

Affiliations

Dong Han: School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK 73019, USA
Reza Babaei: School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK 73019, USA
Shangqing Zhao: School of Computer Science, University of Oklahoma, Norman, OK 73019, USA
Samuel Cheng: School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK 73019, USA

DOI: https://doi.org/10.3390/app14093785
Journal volume & issue: Vol. 14, no. 9
p. 3785

Abstract

Read online

In the rapidly evolving landscape of cybersecurity, model extraction attacks pose a significant challenge, undermining the integrity of machine learning models by enabling adversaries to replicate proprietary algorithms without direct access. This paper presents a comprehensive study on model extraction attacks towards image classification models, focusing on the efficacy of various Deep Q-network (DQN) extensions for enhancing the performance of surrogate models. The goal is to identify the most efficient approaches for choosing images that optimize adversarial benefits. Additionally, we explore synthetic data generation techniques, including the Jacobian-based method, Linf-projected Gradient Descent (LinfPGD), and Fast Gradient Sign Method (FGSM) aiming to facilitate the training of adversary models with enhanced performance. Our investigation also extends to the realm of data-free model extraction attacks, examining their feasibility and performance under constrained query budgets. Our investigation extends to the comparison of these methods under constrained query budgets, where the Prioritized Experience Replay (PER) technique emerges as the most effective, outperforming other DQN extensions and synthetic data generation methods. Through rigorous experimentation, including multiple trials to ensure statistical significance, this work provides valuable insights into optimizing model extraction attacks.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords