Analyzing temporal coherence for deepfake video detection

Muhammad Ahmad Amin; Yongjian Hu; Jiankun Hu

doi:10.3934/era.2024119

Electronic Research Archive (Mar 2024)

Analyzing temporal coherence for deepfake video detection

Muhammad Ahmad Amin,
Yongjian Hu,
Jiankun Hu

Affiliations

Muhammad Ahmad Amin: 1. School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Yongjian Hu: 1. School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Jiankun Hu: 2. School of Engineering and Information Technology, The University of New South Wales, Australian Defence Force Academy Canberra, ACT 2610, Australia

DOI: https://doi.org/10.3934/era.2024119
Journal volume & issue: Vol. 32, no. 4
pp. 2624 – 2641

Abstract

Read online

Current facial image manipulation techniques have caused public concerns while achieving impressive quality. However, these techniques are mostly bound to a single frame for synthesized videos and pay little attention to the most discriminatory temporal frequency artifacts between various frames. Detecting deepfake videos using temporal modeling still poses a challenge. To address this issue, we present a novel deepfake video detection framework in this paper that consists of two levels: temporal modeling and coherence analysis. At the first level, to fully capture temporal coherence over the entire video, we devise an efficient temporal facial pattern (TFP) mechanism that explores the color variations of forgery-sensitive facial areas by providing global and local-successive temporal views. The second level presents a temporal coherence analyzing network (TCAN), which consists of novel global temporal self-attention characteristics, high-resolution fine and low-resolution coarse feature extraction, and aggregation mechanisms, with the aims of long-range relationship modeling from a local-successive temporal perspective within a TFP and capturing the vital dynamic incoherence for robust detection. Thorough experiments on large-scale datasets, including FaceForensics++, DeepFakeDetection, DeepFake Detection Challenge, CelebDF-V2, and DeeperForensics, reveal that our paradigm surpasses current approaches and stays effective when detecting unseen sorts of deepfake videos.

Published in Electronic Research Archive

ISSN: 2688-1594 (Online)
Publisher: AIMS Press
Country of publisher: United States
LCC subjects: Science: Mathematics; Technology: Technology (General): Industrial engineering. Management engineering: Applied mathematics. Quantitative methods
Website: https://www.aimspress.com/journal/era

About the journal

Abstract

Keywords