Accurate 3D Face Reconstruction with Facial Component Tokens

被引：12

作者：

Zhang, Tianke ^{[1
,2
]}

Chu, Xuangeng ^{[2
]}

Liu, Yunfei ^{[2
]}

Lin, Lijian ^{[2
]}

Yang, Zhendong ^{[2
]}

Xu, Zhengzhuo ^{[1
,2
]}

Cao, Chengkun ^{[1
,2
]}

Yu, Fei ^{[3
]}

Zhou, Changyin ^{[3
]}

Yuan, Chun ^{[1
]}

Li, Yu ^{[2
]}

机构：

[1] Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China

[2] IDEA, Shenzhen, Peoples R China

[3] Vistring Inc, Hong Kong, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

国家重点研发计划;

关键词：

MORPHABLE MODEL;

D O I：

10.1109/ICCV51070.2023.00829

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Accurately reconstructing 3D faces from monocular images and videos is crucial for various applications, such as digital avatar creation. However, the current deep learning-based methods face significant challenges in achieving accurate reconstruction with disentangled facial parameters and ensuring temporal stability in single-frame methods for 3D face tracking on video data. In this paper, we propose TokenFace, a transformer-based monocular 3D face reconstruction model. TokenFace uses separate tokens for different facial components to capture information about different facial parameters and employs temporal transformers to capture temporal information from video data. This design can naturally disentangle different facial components and is flexible to both 2D and 3D training data. Trained on hybrid 2D and 3D data, our model shows its power in accurately reconstructing faces from images and producing stable results for video data. Experimental results on popular benchmarks NoW and Stirling demonstrate that TokenFace achieves state-of-the-art performance, outperforming existing methods on all metrics by a large margin.

引用

页码：8999 / 9008

页数：10

共 50 条

[41] On Learning 3D Face Morphable Mode from In-the-Wild Images [J].

Tran, Luan ;

Liu, Xiaoming .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :157-171

[42] From coin to 3D face sculpture portraits in the round of Roman emperors [J].

Castellani, Umberto ;

Bartolomioli, Riccardo ;

Marchioro, Giacomo ;

Calomino, Dario .

COMPUTERS & GRAPHICS-UK, 2024, 123

[43] Physically-guided Disentangled Implicit Rendering for 3D Face Modeling [J].

Zhang, Zhenyu ;

Ge, Yanhao ;

Tai, Ying ;

Cao, Weijian ;

Chen, Renwang ;

Liu, Kunlin ;

Tang, Hao ;

Huang, Xiaoming ;

Wang, Chengjie ;

Xie, Zhifeng ;

Huang, Dongjin .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20321-20331

[44] Analysis of 3D Facial Dysmorphology in Genetic Syndromes from Unconstrained 2D Photographs [J].

Tu, Liyun ;

Porras, Antonio R. ;

Boyle, Alec ;

Linguraru, Marius George .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT I, 2018, 11070 :347-355

[45] MPEG-4 compatible 3D facial animation based on morphable model [J].

Yin, BC ;

Wang, CZ ;

Shi, Q ;

Sun, YF .

PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, :4936-4941

[46] 3D Ear Reconstruction and Identification Method Research Based on Morphable Model [J].

Wang Shuai ;

Mu Zhichun ;

Li Chen ;

Zhang Feng .

PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, :3774-3778

[47] Face recognition based on geodesic preserving projection algorithm with 3D morphable model [J].

Bai, Xiaoming ;

Yin, Baocai ;

Shi, Qin ;

Sun, Yanfeng .

PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, :850-855

[48] Learning an Animatable Detailed 3D Face Model from In-The-Wild Images [J].

Feng, Yao ;

Feng, Haiwen ;

Black, Michael J. ;

Bolkart, Timo .

ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04)

[49] Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection [J].

Zhang, Zhenyu ;

Ge, Yanhao ;

Chen, Renwang ;

Tai, Ying ;

Yan, Yan ;

Yang, Jian ;

Wang, Chengjie ;

Li, Jilin ;

Huang, Feiyue .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14209-14219

[50] Face Spoofing Detection based on 3D Lighting Environment Analysis of Image Pair [J].

Zhang, Xu ;

Hu, Xiyuan ;

Ma, Mingyang ;

Chen, Chen ;

Peng, Silong .

2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, :2995-3000

← 1 2 3 4 5 →