Real‐world super‐resolution of face‐images from surveillance cameras

Andreas Aakerberg; Kamal Nasrollahi; Thomas B. Moeslund

doi:10.1049/ipr2.12359

IET Image Processing (Feb 2022)

Real‐world super‐resolution of face‐images from surveillance cameras

Andreas Aakerberg,
Kamal Nasrollahi,
Thomas B. Moeslund

Affiliations

Andreas Aakerberg: Visual Analysis and Perception Aalborg University Rendsburggade 14 Aalborg Denmark
Kamal Nasrollahi: Visual Analysis and Perception Aalborg University Rendsburggade 14 Aalborg Denmark
Thomas B. Moeslund: Visual Analysis and Perception Aalborg University Rendsburggade 14 Aalborg Denmark

DOI: https://doi.org/10.1049/ipr2.12359
Journal volume & issue: Vol. 16, no. 2
pp. 442 – 452

Abstract

Read online

Abstract Most existing face image Super‐Resolution (SR) methods assume that the Low‐Resolution (LR) images were artificially downsampled from High‐Resolution (HR) images with bicubic interpolation. This operation changes the natural image characteristics and reduces noise. Hence, SR methods trained on such data most often fail to produce good results when applied to real LR images. To solve this problem, a novel framework for the generation of realistic LR/HR training pairs is proposed. The framework estimates realistic blur kernels, noise distributions, and JPEG compression artifacts to generate LR images with similar image characteristics as the ones in the source domain. This allows to train an SR model using high‐quality face images as Ground‐Truth (GT). For better perceptual quality, a Generative Adversarial Network (GAN) based SR model is used, where the commonly used VGG‐loss [1] is exchanged with LPIPS‐loss [2]. Experimental results on both real and artificially corrupted face images show that our method results in more detailed reconstructions with less noise compared to the existing State‐of‐the‐Art (SoTA) methods. In addition, it is shown that the traditional non‐reference Image Quality Assessment (IQA) methods fail to capture this improvement and demonstrate that the more recent NIMA metric [3] correlates better with human perception via Mean Opinion Rank (MOR).

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal