An improved face attributes editing method based on DDIM

Libo He; Qingyang Chen; Yun Pang; Meijiao Wang; Yunyun Wu; Ling Liu; Zhenping Qiang

doi:10.1038/s41598-024-78378-3

Scientific Reports (Nov 2024)

An improved face attributes editing method based on DDIM

Libo He,
Qingyang Chen,
Yun Pang,
Meijiao Wang,
Yunyun Wu,
Ling Liu,
Zhenping Qiang

Affiliations

Libo He: Information Security College, Yunnan Police College
Qingyang Chen: College of Big Data and Intelligent Engineering, Southwest Forestry University
Yun Pang: College of Big Data and Intelligent Engineering, Southwest Forestry University
Meijiao Wang: Information Security College, Yunnan Police College
Yunyun Wu: Information Security College, Yunnan Police College
Ling Liu: Information Security College, Yunnan Police College
Zhenping Qiang: College of Big Data and Intelligent Engineering, Southwest Forestry University

DOI: https://doi.org/10.1038/s41598-024-78378-3
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 18

Abstract

Read online

Abstract The main advantage of DDIM is that it guarantees the quality of the generated images while increasing the efficiency of the generation by modifying the sampling strategy in the diffusion process. DiffusionRig, which addresses the problem of maintaining identity consistency by learning the person-specific facial prior in a tiny personalized dataset, is a successful representation of the DDIM strategy. Based on DiffusionRig, in this article, we propose an improved face attributes editing method based on DDIM to improve naturalness and accuracy of editing results in complex face attribute editing tasks and the generalization ability. Our method combines DDIM and DECA together and use two-stage training strategy. To reduce DiffusionRig’s limitations in handling face attribute editing tasks that require nonlinear understanding and fine-tuning, our method also introduces a channel attention mechanism and a depth-separable convolution technique in the training model. In the first stage we trained our model on the open FFHQ dataset, which consists of 30,000 high-resolution face image. In the second stage, the model was refined by using a tiny personalized dataset. Face attribute editing experiments,comparative experiment with DiffusionRig, and a series of ablation experiments have been performed. Then, the validity of our improved method is verified by qualitative and quantitative analysis.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal