SPA-Net: A Deep Learning Approach Enhanced Using a Span-Partial Structure and Attention Mechanism for Image Copy-Move Forgery Detection

Kaiqi Zhao; Xiaochen Yuan; Zhiyao Xie; Yan Xiang; Guoheng Huang; Li Feng

doi:10.3390/s23146430

Sensors (Jul 2023)

SPA-Net: A Deep Learning Approach Enhanced Using a Span-Partial Structure and Attention Mechanism for Image Copy-Move Forgery Detection

Kaiqi Zhao,
Xiaochen Yuan,
Zhiyao Xie,
Yan Xiang,
Guoheng Huang,
Li Feng

Affiliations

Kaiqi Zhao: School of Computer Science and Engineering, Macau University of Science and Technology, Macao 999078, China
Xiaochen Yuan: Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China
Zhiyao Xie: Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China
Yan Xiang: School of Computer Science and Engineering, Macau University of Science and Technology, Macao 999078, China
Guoheng Huang: School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
Li Feng: School of Computer Science and Engineering, Macau University of Science and Technology, Macao 999078, China

DOI: https://doi.org/10.3390/s23146430
Journal volume & issue: Vol. 23, no. 14
p. 6430

Abstract

Read online

With the wide application of visual sensors and development of digital image processing technology, image copy-move forgery detection (CMFD) has become more and more prevalent. Copy-move forgery is copying one or several areas of an image and pasting them into another part of the same image, and CMFD is an efficient means to expose this. There are improper uses of forged images in industry, the military, and daily life. In this paper, we present an efficient end-to-end deep learning approach for CMFD, using a span-partial structure and attention mechanism (SPA-Net). The SPA-Net extracts feature roughly using a pre-processing module and finely extracts deep feature maps using the span-partial structure and attention mechanism as a SPA-net feature extractor module. The span-partial structure is designed to reduce the redundant feature information, while the attention mechanism in the span-partial structure has the advantage of focusing on the tamper region and suppressing the original semantic information. To explore the correlation between high-dimension feature points, a deep feature matching module assists SPA-Net to locate the copy-move areas by computing the similarity of the feature map. A feature upsampling module is employed to upsample the features to their original size and produce a copy-move mask. Furthermore, the training strategy of SPA-Net without pretrained weights has a balance between copy-move and semantic features, and then the module can capture more features of copy-move forgery areas and reduce the confusion from semantic objects. In the experiment, we do not use pretrained weights or models from existing networks such as VGG16, which would bring the limitation of the network paying more attention to objects other than copy-move areas.To deal with this problem, we generated a SPANet-CMFD dataset by applying various processes to the benchmark images from SUN and COCO datasets, and we used existing copy-move forgery datasets, CMH, MICC-F220, MICC-F600, GRIP, Coverage, and parts of USCISI-CMFD, together with our generated SPANet-CMFD dataset, as the training set to train our model. In addition, the SPANet-CMFD dataset could play a big part in forgery detection, such as deepfakes. We employed the CASIA and CoMoFoD datasets as testing datasets to verify the performance of our proposed method. The Precision, Recall, and F1 are calculated to evaluate the CMFD results. Comparison results showed that our model achieved a satisfactory performance on both testing datasets and performed better than the existing methods.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords