Toward Unbiased High-Quality Portraits through Latent-Space Evaluation

Doaa Almhaithawi; Alessandro Bellini; Tania Cerquitelli

doi:10.3390/jimaging10070157

Journal of Imaging (Jun 2024)

Toward Unbiased High-Quality Portraits through Latent-Space Evaluation

Doaa Almhaithawi,
Alessandro Bellini,
Tania Cerquitelli

Affiliations

Doaa Almhaithawi: Department of Control and Computer Engineering, Politecnico di Torino, 10129 Torino, Italy
Alessandro Bellini: Prime Lab, Mathema s.r.l., 50142 Florence, Italy
Tania Cerquitelli: Department of Control and Computer Engineering, Politecnico di Torino, 10129 Torino, Italy

DOI: https://doi.org/10.3390/jimaging10070157
Journal volume & issue: Vol. 10, no. 7
p. 157

Abstract

Read online

Images, texts, voices, and signals can be synthesized by latent spaces in a multidimensional vector, which can be explored without the hurdles of noise or other interfering factors. In this paper, we present a practical use case that demonstrates the power of latent space in exploring complex realities such as image space. We focus on DaVinciFace, an AI-based system that explores the StyleGAN2 space to create a high-quality portrait for anyone in the style of the Renaissance genius Leonardo da Vinci. The user enters one of their portraits and receives the corresponding Da Vinci-style portrait as an output. Since most of Da Vinci’s artworks depict young and beautiful women (e.g., “La Belle Ferroniere”, “Beatrice de’ Benci”), we investigate the ability of DaVinciFace to account for other social categorizations, including gender, race, and age. The experimental results evaluate the effectiveness of our methodology on 1158 portraits acting on the vector representations of the latent space to produce high-quality portraits that retain the facial features of the subject’s social categories, and conclude that sparser vectors have a greater effect on these features. To objectively evaluate and quantify our results, we solicited human feedback via a crowd-sourcing campaign. Analysis of the human feedback showed a high tolerance for the loss of important identity features in the resulting portraits when the Da Vinci style is more pronounced, with some exceptions, including Africanized individuals.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords