Unsupervised learning of style-aware facial animation from real acting performances

被引：6

作者：

Paier, Wolfgang ^{[1
]}

Hilsmann, Anna ^{[1
]}

Eisert, Peter ^{[1
,2
]}

机构：

[1] Fraunhofer Heinrich Hertz Inst, Berlin, Germany

[2] Humboldt Univ, Berlin, Germany

来源：

GRAPHICAL MODELS | 2023年 / 129卷

基金：

欧盟地平线“2020”;

关键词：

Facial animation; Neural rendering; Neural animation; Self-supervised learning; Dynamic textures; VIDEO; MODEL;

D O I：

10.1016/j.gmod.2023.101199

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper presents a novel approach for text/speech-driven animation of a photo-realistic head model based on blend-shape geometry, dynamic textures, and neural rendering. Training a VAE for geometry and texture yields a parametric model for accurate capturing and realistic synthesis of facial expressions from latent feature vector. Our animation method is based on a conditional CNN that transforms text or speech into a sequence of animation parameters. In contrast to previous approaches, our animation model learns disentangling/synthesizing different acting-styles in an unsupervised manner, requiring only phonetic labels that describe the content of training sequences. For realistic real-time rendering, we train a U-Net that refines rasterization-based renderings by computing improved pixel colors and a foreground matte. We compare our framework qualitatively/quantitatively against recent methods for head modeling as well as facial animation and evaluate the perceived rendering/animation quality in a user-study, which indicates large improvements compared to state-of-the-art approaches.

引用

页数：13

共 86 条

[1] Neural Point-Based Graphics [J].

Aliev, Kara-Ali ;

Sevastopolsky, Artem ;

Kolos, Maria ;

Ulyanov, Dmitry ;

Lempitsky, Victor .

COMPUTER VISION - ECCV 2020, PT XXII, 2020, 12367 :696-712

[2] Deep Relightable Appearance Models for Animatable Faces [J].

Bi, Sai ;

Lombardi, Stephen ;

Saito, Shunsuke ;

Simon, Tomas ;

Wei, Shih-En ;

Mcphail, Kevyn ;

Ramamoorthi, Ravi ;

Sheikh, Yaser ;

Saragih, Jason .

ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04)

[3] A morphable model for the synthesis of 3D faces [J].

Blanz, V ;

Vetter, T .

SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194

[4]

Borshukov George, 2006, ACM SIGGRAPH 2006 SK

[5] Online Modeling For Realtime Facial Animation [J].

Bouaziz, Sofien ;

Wang, Yangang ;

Pauly, Mark .

ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)

[6] FaceWarehouse: A 3D Facial Expression Database for Visual Computing [J].

Cao, Chen ;

Weng, Yanlin ;

Zhou, Shun ;

Tong, Yiying ;

Zhou, Kun .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) :413-425

[7] Free-viewpoint video of human actors [J].

Carranza, J ;

Theobalt, C ;

Magnor, MA ;

Seidel, HP .

ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :569-577

[8] 4D video textures for interactive character appearance [J].

Casas, Dan ;

Volino, Marco ;

Collomosse, John ;

Hilton, Adrian .

COMPUTER GRAPHICS FORUM, 2014, 33 (02) :371-380

[9] EXPRESSION-AWARE FACE RECONSTRUCTION VIA A DUAL-STREAM NETWORK [J].

Chai, Xiaoyu ;

Chen, Jun ;

Liang, Chao ;

Xu, Dongshu ;

Lin, Chia-Wen .

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,

[10] Semantic Deep Face Models [J].

Chandran, Prashanth ;

Bradley, Derek ;

Gross, Markus ;

Beeler, Thabo .

2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, :345-354

← 1 2 3 4 5 6 7 8 9 →