Data free knowledge distillation with feature synthesis and spatial consistency for image analysis

Pengchen Liang; Jianguo Chen; Yan Wu; Bin Pu; Haishan Huang; Qing Chang; Guo Ran

doi:10.1038/s41598-024-78757-w

Scientific Reports (Nov 2024)

Data free knowledge distillation with feature synthesis and spatial consistency for image analysis

Pengchen Liang,
Jianguo Chen,
Yan Wu,
Bin Pu,
Haishan Huang,
Qing Chang,
Guo Ran

Affiliations

Pengchen Liang: The Department of Anesthesiology, Eye & ENT Hospital, Fudan University
Jianguo Chen: School of Software Engineering, Sun Yat-sen University
Yan Wu: Huangdu Community Health Service Center
Bin Pu: Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology
Haishan Huang: School of Software Engineering, Sun Yat-sen University
Qing Chang: The Department Shanghai Key Laboratory of Gastric Neoplasms, Department of Surgery, Shanghai Institute of Digestive Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine
Guo Ran: The Department of Anesthesiology, Eye & ENT Hospital, Fudan University

DOI: https://doi.org/10.1038/s41598-024-78757-w
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Privacy and security concerns restrict access to original training datasets, posing significant challenges for model compression. Data-Free Knowledge Distillation (DFKD) emerges as a solution, aiming to transfer knowledge from teacher to student networks without accessing original data. Existing DFKD methods struggle to generate high-quality synthetic samples that capture the complexities of real-world data, leading to suboptimal knowledge transfer. Moreover, these approaches often fail to preserve the spatial attributes of the teacher network, resulting in shortcut learning and limited generalization.To address these issues, a novel DFKD strategy is proposed with three innovations: (1) an enhanced DCGAN generator with an attention module for synthesizing samples with improved micro-discriminative features; (2) a Multi-Scale Spatial Activation Region Consistency (MSARC) mechanism to accurately replicate the teacher’s spatial attributes; and (3) an adversarial learning framework that creates a dynamic competitive environment between the generative and distillation phases. Rigorous evaluation of the method on several benchmark datasets, including CIFAR-10, CIFAR-100, Tiny-ImageNet, and medical imaging datasets such as PathMNIST, BloodMNIST, and PneumoniaMNIST, demonstrates superior performance compared to existing DFKD methods. Specifically, on CIFAR-100, the student network attains an accuracy of 77.85%, surpassing previous methods like CMI and SpaceshipNet. On BloodMNIST, the method achieves an accuracy of 80.50%, outperforming the next best method by over 5%.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords