Data augmentation method based on diffusion model for domain generalization

Yujun TONG; Heqing WANG; Yueheng LUO; Wenxin NING; Mandan GUAN; Wenqing YU; Keyan HUANG; Jiaxun ZHANG; Zhanyu MA

智能科学与技术学报 (Sep 2023)

Data augmentation method based on diffusion model for domain generalization

Yujun TONG,
Heqing WANG,
Yueheng LUO,
Wenxin NING,
Mandan GUAN,
Wenqing YU,
Keyan HUANG,
Jiaxun ZHANG,
Zhanyu MA

Affiliations

Yujun TONG
Heqing WANG
Yueheng LUO
Wenxin NING
Mandan GUAN
Wenqing YU
Keyan HUANG
Jiaxun ZHANG
Zhanyu MA

Journal volume & issue: Vol. 5
pp. 380 – 388

Abstract

Read online

Domain generalization is an important and challenging problem in computer vision, arising from the distribution shift of real-world data.In practical applications, it is common to encounter training and testing data from different domains, and the difference in data distribution can lead to performance degradation during testing.In this paper, we propose a domain generalization method based on latent space data augmentation.Unlike traditional image-level data augmentation approaches, the method introduces a diffusion model in the latent space to achieve fine control and diversity generation of features, thereby achieving feature level data augmentation and enhancing the model's generalization ability in the target domain.Specifically, the classifier-based implicit diffusion model, trained within the latent space, can conditionally generate accurate and rich source domain features.It leverages efficient sampling techniques to expedite the generation of augmented features.Experimental results show that the method has achieved significant performance improvement in various domain generalization tasks, and has good effectiveness and robustness in real scenarios.The key innovation of this paper lies in shifting data augmentation to the latent space level and introducing the diffusion model for augmentation, providing a novel approach to address the domain generalization problem.

domain generalization;diffusion model;data augmentation

Published in 智能科学与技术学报

ISSN: 2096-6652 (Print)
Publisher: POSTS&TELECOM PRESS Co., LTD
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.cjist.com.cn

About the journal

Abstract

Keywords