Learning to Generate Customized Dynamic 3D Facial Expressions

被引:13
作者
Potamias, Rolandos Alexandros [1 ]
Zheng, Jiali [1 ]
Ploumpis, Stylianos [1 ]
Bouritsas, Giorgos [1 ]
Ververas, Evangelos [1 ]
Zafeiriou, Stefanos [1 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
来源
COMPUTER VISION - ECCV 2020, PT XXIX | 2020年 / 12374卷
基金
英国工程与自然科学研究理事会;
关键词
Expression generation; Facial animation; 4D synthesis; 4DFAB; Graph neural networks; DATABASE;
D O I
10.1007/978-3-030-58526-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in deep learning have significantly pushed the state-of-the-art in photorealistic video animation given a single image. In this paper, we extrapolate those advances to the 3D domain, by studying 3D image-to-video translation with a particular focus on 4D facial expressions. Although 3D facial generative models have been widely explored during the past years, 4D animation remains relatively unexplored. To this end, in this study we employ a deep mesh encoder-decoder like architecture to synthesize realistic high resolution facial expressions by using a single neutral frame along with an expression identification. In addition, processing 3D meshes remains a non-trivial task compared to data that live on grid-like structures, such as images. Given the recent progress in mesh processing with graph convolutions, we make use of a recently introduced learnable operator which acts directly on the mesh structure by taking advantage of local vertex orderings. In order to generalize to 4D facial expressions across subjects, we trained our model using a high resolution dataset with 4D scans of six facial expressions from 180 subjects. Experimental results demonstrate that our approach preserves the subject's identity information even for unseen subjects and generates high quality expressions. To the best of our knowledge, this is the first study tackling the problem of 4D facial expression synthesis.
引用
收藏
页码:278 / 294
页数:17
相关论文
共 49 条
[1]  
Alashkar T., 2014, 5 INT C EXH 3D BOD S
[2]  
Amberg B, 2008, IEEE INT CONF AUTOMA, P667
[3]  
Atwood J, 2016, ADV NEUR IN, V29
[4]  
Boscaini D, 2016, ADV NEUR IN, V29
[5]   Online Modeling For Realtime Facial Animation [J].
Bouaziz, Sofien ;
Wang, Yangang ;
Pauly, Mark .
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)
[6]   Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation [J].
Bouritsas, Giorgos ;
Bokhnyak, Sergiy ;
Ploumpis, Stylianos ;
Bronstein, Michael ;
Zafeiriou, Stefanos .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7212-7221
[7]   Geometric Deep Learning Going beyond Euclidean data [J].
Bronstein, Michael M. ;
Bruna, Joan ;
LeCun, Yann ;
Szlam, Arthur ;
Vandergheynst, Pierre .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) :18-42
[8]  
Bruna J., 2014, P INT C LEARN REPR B
[9]   Displaced Dynamic Expression Regression for Real-time Facial Tracking and Animation [J].
Cao, Chen ;
Hou, Qiming ;
Zhou, Kun .
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04)
[10]  
Chang Y, 2005, LECT NOTES COMPUT SC, V3723, P293