Skip to main navigation Skip to search Skip to main content

StyleMask: disentangling the style space of StyleGAN2 for neural face reenactment

  • Stella Bounareli
  • , Christos Tzelepis
  • , Vasileios Argyriou
  • , Ioannis Patras
  • , Georgios Tzimiropoulos
  • Kingston University
  • Queen Mary University of London

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper we address the problem of neural face reenactment, where, given a pair of a source and a target facial image, we need to transfer the target's pose (defined as the head pose and its facial expressions) to the source image, by preserving at the same time the source's identity characteristics (e.g., facial shape, hair style, etc), even in the challenging case where the source and the target faces belong to different identities. In doing so, we address some of the limitations of the state-of-the-art works, namely, a) that they depend on paired training data (i.e., source and target faces have the same identity), b) that they rely on labeled data during inference, and c) that they do not preserve identity in large head pose changes. More specifically, we propose a framework that, using unpaired randomly generated facial images, learns to disentangle the identity characteristics of the face from its pose by incorporating the recently introduced style space S [1] of StyleGAN2 [2], a latent representation space that exhibits remarkable disentanglement properties. By capitalizing on this, we learn to successfully mix a pair of source and target style codes using supervision from a 3D model. The resulting latent code, that is subsequently used for reenactment, consists of latent units corresponding to the facial pose of the target only and of units corresponding to the identity of the source only, leading to notable improvement in the reenactment performance compared to recent state-of-the-art methods. In comparison to state of the art, we quantitatively and qualitatively show that the proposed method produces higher quality results even on extreme pose variations. Finally, we report results on real images by first embedding them on the latent space of the pretrained generator. We make the code and the pretrained models publicly available at: https://github.com/StelaBou/StyleMask.

Original languageEnglish
Title of host publicationProceedings of the 2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition, FG 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages8
ISBN (Electronic)9798350345445
ISBN (Print)9798350345452
DOIs
Publication statusPublished - 16 Feb 2023
Event17th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2023 co-located with WACV 2023 - Waikoloa Beach, United States
Duration: 5 Jan 20238 Jan 2023

Publication series

NameProceedings of the International Conference on Automatic Face and Gesture Recognition
PublisherInstitute of Electrical and Electronics Engineers

Conference

Conference17th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2023 co-located with WACV 2023
Country/TerritoryUnited States
CityWaikoloa Beach
Period5/01/238/01/23

Fingerprint

Dive into the research topics of 'StyleMask: disentangling the style space of StyleGAN2 for neural face reenactment'. Together they form a unique fingerprint.

Cite this