Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation

Chang-Teng Shi; Meng-Jun Li; Zhi Yong An

doi:10.1007/s40747-024-01673-z

Complex & Intelligent Systems (Dec 2024)

Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation

Chang-Teng Shi,
Meng-Jun Li,
Zhi Yong An

Affiliations

Chang-Teng Shi: College of Computer Science and Technology, Shandong Technology and Business University
Meng-Jun Li: College of Computer Science and Technology, Shandong Technology and Business University
Zhi Yong An: College of Computer Science and Technology, Shandong Technology and Business University

DOI: https://doi.org/10.1007/s40747-024-01673-z
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Face super-resolution technology can significantly enhance the resolution and quality of face images, which is crucial for applications such as surveillance, forensics, and face recognition. However, existing methods often fail to fully utilize multi-scale information and facial priors, resulting in poor recovery of facial structures in complex images. To address this issue, we propose a face super-resolution method based on iterative collaboration between a facial reconstruction network and a landmark estimation network. This method employs a Multi-Convolutional Attention Block for multi-scale feature extraction, and an Attention Fusion Block is introduced to enhance features using facial priors. Subsequently, features are further refined using a Residual Window Attention Group. Furthermore, the method involves iterative collaboration between the facial reconstruction network and the landmark estimation network. At each step, landmark priors are used to generate higher quality images, which are then utilized for improved landmark estimation, thereby gradually enhancing performance. Through evaluation of the standard 4 $$\times $$ × , 8 $$\times $$ × , and 16 $$\times $$ × super-resolution tasks on the CelebA and Helen datasets, This method demonstrates strong performance and achieves competitive scores on SSIM, PSNR, and LPIPS metrics. Specifically, in the 8 $$\times $$ × super-resolution experiment, the PSNR/SSIM/LPIPS on CelebA dataset is 27.68dB/ 0.8112/0.0866, outperforming existing state-of-the-art methods in terms of both accuracy and visual quality.

Published in Complex & Intelligent Systems

ISSN: 2199-4536 (Print); 2198-6053 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.springer.com/journal/40747

About the journal

Abstract

Keywords