Robust Attentive Deep Neural Network for Detecting GAN-Generated Faces

Hui Guo; Shu Hu; Xin Wang; Ming-Ching Chang; Siwei Lyu

doi:10.1109/ACCESS.2022.3157297

IEEE Access (Jan 2022)

Robust Attentive Deep Neural Network for Detecting GAN-Generated Faces

Hui Guo,
Shu Hu,
Xin Wang,
Ming-Ching Chang,
Siwei Lyu

Affiliations

Hui Guo: ORCiD; Department of Computer Science, College of Engineering and Applied Sciences, University at Albany, State University of New York, Albany, NY, USA
Shu Hu: ORCiD; Department of Computer Science and Engineering, University at Buffalo, State University of New York, Buffalo, NY, USA
Xin Wang: Department of Computer Science and Engineering, University at Buffalo, State University of New York, Buffalo, NY, USA
Ming-Ching Chang: ORCiD; Department of Computer Science, College of Engineering and Applied Sciences, University at Albany, State University of New York, Albany, NY, USA
Siwei Lyu: ORCiD; Department of Computer Science and Engineering, University at Buffalo, State University of New York, Buffalo, NY, USA

DOI: https://doi.org/10.1109/ACCESS.2022.3157297
Journal volume & issue: Vol. 10
pp. 32574 – 32583

Abstract

Read online

Generative Adversarial Network (GAN) based techniques can generate and synthesize realistic faces that cause profound social concerns and security problems. Existing methods for detecting GAN-generated faces can perform well on limited public datasets. However, images from existing datasets do not represent real-world scenarios well enough in terms of view variations and data distributions, where real faces largely outnumber synthetic ones. The state-of-the-art methods do not generalize well in real-world problems and lack the interpretability of detection results. Performance of existing GAN-face detection models degrades accordingly when facing data imbalance issues. To address these shortcomings, we propose a robust, attentive, end-to-end framework that spots GAN-generated faces by analyzing eye inconsistencies. Our model automatically learns to identify inconsistent eye components by localizing and comparing artifacts between eyes. After the iris regions are extracted by Mask-RCNN, we design a Residual Attention Network (RAN) to examine the consistency between the corneal specular highlights of the two eyes. Our method can effectively learn from imbalanced data using a joint loss function combining the traditional cross-entropy loss with a relaxation of the ROC-AUC loss via Wilcoxon-Mann-Whitney (WMW) statistics. Comprehensive evaluations on a newly created FFHQ-GAN dataset in both balanced and imbalanced scenarios demonstrate the superiority of our method.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords