Self-Supervised Spontaneous Latent-Based Facial Expression Sequence Generation

Chuin Hong Yap; Moi Hoon Yap; Adrian K. Davison; Ryan Cunningham

doi:10.1109/OJSP.2023.3275052

IEEE Open Journal of Signal Processing (Jan 2023)

Self-Supervised Spontaneous Latent-Based Facial Expression Sequence Generation

Chuin Hong Yap,
Moi Hoon Yap,
Adrian K. Davison,
Ryan Cunningham

Affiliations

Chuin Hong Yap: ORCiD; Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, U.K.
Moi Hoon Yap: ORCiD; Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, U.K.
Adrian K. Davison: ORCiD; Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, U.K.
Ryan Cunningham: ORCiD; Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, U.K.

DOI: https://doi.org/10.1109/OJSP.2023.3275052
Journal volume & issue: Vol. 4
pp. 304 – 312

Abstract

Read online

In this article, we investigate the spontaneity issue in facial expression sequence generation. Current leading methods in the field are commonly reliant on manually adjusted conditional variables to direct the model to generate a specific class of expression. We propose a neural network-based method which uses Gaussian noise to model spontaneity in the generation process, removing the need for manual control of conditional generation variables. Our model takes two sequential images as input, with additive noise, and produces the next image in the sequence. We trained two types of models: single-expression, and mixed-expression. With single-expression, unique facial movements of certain emotion class can be generated; with mixed expressions, fully spontaneous expression sequence generation can be achieved. We compared our method to current leading generation methods on a variety of publicly available datasets. Initial qualitative results show our method produces visually more realistic expressions and facial action unit (AU) trajectories; initial quantitative results using image quality metrics (SSIM and NIQE) show the quality of our generated images is higher. Our approach and results are novel in the field of facial expression generation, with potential wider applications to other sequence generation tasks.

Published in IEEE Open Journal of Signal Processing

ISSN: 2644-1322 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8782710

About the journal

Abstract

Keywords