PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN

Lin, Kai-En; Trevithick, Alex; Cheng, Keli; Sarkis, Michel; Ghafoorian, Mohsen; Bi, Ning; Reitmayr, Gerhard; Ramamoorthi, Ravi

PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN

dc.contributor.author	Lin, Kai-En	en_US
dc.contributor.author	Trevithick, Alex	en_US
dc.contributor.author	Cheng, Keli	en_US
dc.contributor.author	Sarkis, Michel	en_US
dc.contributor.author	Ghafoorian, Mohsen	en_US
dc.contributor.author	Bi, Ning	en_US
dc.contributor.author	Reitmayr, Gerhard	en_US
dc.contributor.author	Ramamoorthi, Ravi	en_US
dc.contributor.editor	Ritschel, Tobias	en_US
dc.contributor.editor	Weidlich, Andrea	en_US
dc.date.accessioned	2023-06-27T07:03:42Z
dc.date.available	2023-06-27T07:03:42Z
dc.date.issued	2023
dc.description.abstract	Portrait synthesis creates realistic digital avatars which enable users to interact with others in a compelling way. Recent advances in StyleGAN and its extensions have shown promising results in synthesizing photorealistic and accurate reconstruction of human faces. However, previous methods often focus on frontal face synthesis and most methods are not able to handle large head rotations due to the training data distribution of StyleGAN. In this work, our goal is to take as input a monocular video of a face, and create an editable dynamic portrait able to handle extreme head poses. The user can create novel viewpoints, edit the appearance, and animate the face. Our method utilizes pivotal tuning inversion (PTI) to learn a personalized video prior from a monocular video sequence. Then we can input pose and expression coefficients to MLPs and manipulate the latent vectors to synthesize different viewpoints and expressions of the subject. We also propose novel loss functions to further disentangle pose and expression in the latent space. Our algorithm shows much better performance over previous approaches on monocular video datasets, and it is also capable of running in real-time at 54 FPS on an RTX 3080.	en_US
dc.description.number	4
dc.description.sectionheaders	Video and Editing
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	42
dc.identifier.doi	10.1111/cgf.14890
dc.identifier.issn	1467-8659
dc.identifier.pages	13 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.14890
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14890
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies -> Image-based rendering
dc.subject	Computing methodologies
dc.subject	Image
dc.subject	based rendering
dc.title	PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: v42i4_10_14890.pdf
Size:: 49.67 MB
Format:: Adobe Portable Document Format
Description:

Download

Name:: pvp_supp_crc.pdf
Size:: 68.42 MB
Format:: Adobe Portable Document Format

Download

Collections

42-Issue 4
EGSR23: 34th Eurographics Symposium on Rendering