Unsupervised learning of style-aware facial animation from real acting performances

被引:6
作者
Paier, Wolfgang [1 ]
Hilsmann, Anna [1 ]
Eisert, Peter [1 ,2 ]
机构
[1] Fraunhofer Heinrich Hertz Inst, Berlin, Germany
[2] Humboldt Univ, Berlin, Germany
基金
欧盟地平线“2020”;
关键词
Facial animation; Neural rendering; Neural animation; Self-supervised learning; Dynamic textures; VIDEO; MODEL;
D O I
10.1016/j.gmod.2023.101199
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a novel approach for text/speech-driven animation of a photo-realistic head model based on blend-shape geometry, dynamic textures, and neural rendering. Training a VAE for geometry and texture yields a parametric model for accurate capturing and realistic synthesis of facial expressions from latent feature vector. Our animation method is based on a conditional CNN that transforms text or speech into a sequence of animation parameters. In contrast to previous approaches, our animation model learns disentangling/synthesizing different acting-styles in an unsupervised manner, requiring only phonetic labels that describe the content of training sequences. For realistic real-time rendering, we train a U-Net that refines rasterization-based renderings by computing improved pixel colors and a foreground matte. We compare our framework qualitatively/quantitatively against recent methods for head modeling as well as facial animation and evaluate the perceived rendering/animation quality in a user-study, which indicates large improvements compared to state-of-the-art approaches.
引用
收藏
页数:13
相关论文
共 86 条
[1]   Neural Point-Based Graphics [J].
Aliev, Kara-Ali ;
Sevastopolsky, Artem ;
Kolos, Maria ;
Ulyanov, Dmitry ;
Lempitsky, Victor .
COMPUTER VISION - ECCV 2020, PT XXII, 2020, 12367 :696-712
[2]   Deep Relightable Appearance Models for Animatable Faces [J].
Bi, Sai ;
Lombardi, Stephen ;
Saito, Shunsuke ;
Simon, Tomas ;
Wei, Shih-En ;
Mcphail, Kevyn ;
Ramamoorthi, Ravi ;
Sheikh, Yaser ;
Saragih, Jason .
ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04)
[3]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[4]  
Borshukov George, 2006, ACM SIGGRAPH 2006 SK
[5]   Online Modeling For Realtime Facial Animation [J].
Bouaziz, Sofien ;
Wang, Yangang ;
Pauly, Mark .
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)
[6]   FaceWarehouse: A 3D Facial Expression Database for Visual Computing [J].
Cao, Chen ;
Weng, Yanlin ;
Zhou, Shun ;
Tong, Yiying ;
Zhou, Kun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) :413-425
[7]   Free-viewpoint video of human actors [J].
Carranza, J ;
Theobalt, C ;
Magnor, MA ;
Seidel, HP .
ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :569-577
[8]   4D video textures for interactive character appearance [J].
Casas, Dan ;
Volino, Marco ;
Collomosse, John ;
Hilton, Adrian .
COMPUTER GRAPHICS FORUM, 2014, 33 (02) :371-380
[9]   EXPRESSION-AWARE FACE RECONSTRUCTION VIA A DUAL-STREAM NETWORK [J].
Chai, Xiaoyu ;
Chen, Jun ;
Liang, Chao ;
Xu, Dongshu ;
Lin, Chia-Wen .
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[10]   Semantic Deep Face Models [J].
Chandran, Prashanth ;
Bradley, Derek ;
Gross, Markus ;
Beeler, Thabo .
2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, :345-354