Jointly learning and training: using style diversification to improve domain generalization for deepfake detection

Jicheng Li; Beibei Liu; Hao-Tian Wu; Yongjian Hu; Chang-Tsun Li

doi:10.3934/era.2024090

Electronic Research Archive (Mar 2024)

Jointly learning and training: using style diversification to improve domain generalization for deepfake detection

Jicheng Li,
Beibei Liu,
Hao-Tian Wu,
Yongjian Hu,
Chang-Tsun Li

Affiliations

Jicheng Li: 1. School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Beibei Liu: 1. School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Hao-Tian Wu: 2. Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou 510006, China
Yongjian Hu: 1. School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Chang-Tsun Li: 3. School of Information Technology, Deakin University, Geelong, VIC 3216, Australia

DOI: https://doi.org/10.3934/era.2024090
Journal volume & issue: Vol. 32, no. 3
pp. 1973 – 1997

Abstract

Read online

Most existing deepfake detection methods often fail to maintain their performance when confronting new test domains. To address this issue, we propose a generalizable deepfake detection system to implement style diversification by alternately learning the domain generalization (DG)-based detector and the stylized fake face synthesizer (SFFS). For the DG-based detector, we first adopt instance normalization- and batch normalization-based structures to extract the local and global image statistics as the style and content features, which are then leveraged to obtain the more diverse feature space. Subsequently, contrastive learning is used to emphasize common style features while suppressing domain-specific ones, and adversarial learning is performed to obtain the domain-invariant features. These optimized features help the DG-based detector to learn generalized classification features and also encourage the SFFS to simulate possibly unseen domain data. In return, the samples generated by the SFFS would contribute to the detector's learning of more generalized features from augmented training data. Such a joint learning and training process enhances both the detector's and the synthesizer's feature representation capability for generalizable deepfake detection. Experimental results demonstrate that our method outperforms the state-of-the-art competitors not only in intra-domain tests but particularly in cross-domain tests.

Published in Electronic Research Archive

ISSN: 2688-1594 (Online)
Publisher: AIMS Press
Country of publisher: United States
LCC subjects: Science: Mathematics; Technology: Technology (General): Industrial engineering. Management engineering: Applied mathematics. Quantitative methods
Website: https://www.aimspress.com/journal/era

About the journal

Abstract

Keywords