TraceGuard: Fine-Tuning Pre-Trained Model by Using Stego Images to Trace Its User

Limengnan Zhou; Xingdong Ren; Cheng Qian; Guangling Sun

doi:10.3390/math12213333

Mathematics (Oct 2024)

TraceGuard: Fine-Tuning Pre-Trained Model by Using Stego Images to Trace Its User

Limengnan Zhou,
Xingdong Ren,
Cheng Qian,
Guangling Sun

Affiliations

Limengnan Zhou: School of Electronic and Information Engineering, University of Electronic Science and Technology of China, Zhongshan Institute, Zhongshan 528402, China
Xingdong Ren: School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
Cheng Qian: School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
Guangling Sun: School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China

DOI: https://doi.org/10.3390/math12213333
Journal volume & issue: Vol. 12, no. 21
p. 3333

Abstract

Read online

Currently, a significant number of pre-trained models are published online to provide services to users owing to the rapid maturation and popularization of machine learning as a service (MLaaS). Some malicious users have pre-trained models illegally to redeploy them and earn money. However, most of the current methods focus on verifying the copyright of the model rather than tracing responsibility for the suspect model. In this study, TraceGuard is proposed, the first framework based on steganography for tracing a suspect self-supervised learning (SSL) pre-trained model, to ascertain which authorized user illegally released the suspect model or if the suspect model is independent. Concretely, the framework contains an encoder and decoder pair and the SSL pre-trained model. Initially, the base pre-trained model is frozen, and the encoder and decoder are jointly learned to ensure the two modules can embed the secret key into the cover image and extract the secret key from the embedding output by the base pre-trained model. Subsequently, the base pre-trained model is fine-tuned using stego images to implement a fingerprint while the encoder and decoder are frozen. To assure the effectiveness and robustness of the fingerprint and the utility of fingerprinted pre-trained models, three alternate steps of model stealing simulations, fine-tuning for uniqueness, and fine-tuning for utility are designed. Finally, the suspect pre-trained model is traced to its user by querying stego images. Experimental results demonstrate that TraceGuard can reliably trace suspect models and is robust against common fingerprint removal attacks such as fine-tuning, pruning, and model stealing. In the future, we will further improve the robustness against model stealing attack.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords