Synthesising Facial Macro- and Micro-Expressions Using Reference Guided Style Transfer

Chuin Hong Yap; Ryan Cunningham; Adrian K. Davison; Moi Hoon Yap

doi:10.3390/jimaging7080142

Journal of Imaging (Aug 2021)

Synthesising Facial Macro- and Micro-Expressions Using Reference Guided Style Transfer

Chuin Hong Yap,
Ryan Cunningham,
Adrian K. Davison,
Moi Hoon Yap

Affiliations

Chuin Hong Yap: Department of Computing and Mathematics, Manchester Metropolitan University, Manchester M15 6BH, UK
Ryan Cunningham: Department of Computing and Mathematics, Manchester Metropolitan University, Manchester M15 6BH, UK
Adrian K. Davison: Faculty of Biology, Medicine and Health, The University of Manchester, Manchester M13 9PL, UK
Moi Hoon Yap: Department of Computing and Mathematics, Manchester Metropolitan University, Manchester M15 6BH, UK

DOI: https://doi.org/10.3390/jimaging7080142
Journal volume & issue: Vol. 7, no. 8
p. 142

Abstract

Read online

Long video datasets of facial macro- and micro-expressions remains in strong demand with the current dominance of data-hungry deep learning methods. There are limited methods of generating long videos which contain micro-expressions. Moreover, there is a lack of performance metrics to quantify the generated data. To address the research gaps, we introduce a new approach to generate synthetic long videos and recommend assessment methods to inspect dataset quality. For synthetic long video generation, we use the state-of-the-art generative adversarial network style transfer method—StarGANv2. Using StarGANv2 pre-trained on the CelebA dataset, we transfer the style of a reference image from SAMM long videos (a facial micro- and macro-expression long video dataset) onto a source image of the FFHQ dataset to generate a synthetic dataset (SAMM-SYNTH). We evaluate SAMM-SYNTH by conducting an analysis based on the facial action units detected by OpenFace. For quantitative measurement, our findings show high correlation on two Action Units (AUs), i.e., AU12 and AU6, of the original and synthetic data with a Pearson’s correlation of 0.74 and 0.72, respectively. This is further supported by evaluation method proposed by OpenFace on those AUs, which also have high scores of 0.85 and 0.59. Additionally, optical flow is used to visually compare the original facial movements and the transferred facial movements. With this article, we publish our dataset to enable future research and to increase the data pool of micro-expressions research, especially in the spotting task.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords