FaceLift: Semi-supervised 3D Facial Landmark Localization

被引:2
作者
Ferman, David [1 ]
Garrido, Pablo [1 ]
Bharaj, Gaurav [1 ]
机构
[1] Flawless AI, London, England
来源
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年
关键词
D O I
10.1109/CVPR52733.2024.00175
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D facial landmark localization has proven to be of particular use for applications, such as face tracking, 3D face modeling, and image-based 3D face reconstruction. In the supervised learning case, such methods usually rely on 3D landmark datasets derived from 3DMM-based registration that often lack spatial definition alignment, as compared with that chosen by hand-labeled human consensus, e.g., how are eyebrow landmarks defined? This creates a gap between landmark datasets generated via high-quality 2D human labels and 3DMMs, and it ultimately limits their effectiveness. To address this issue, we introduce a novel semi- supervised learning approach that learns 3D landmarks by directly lifting (visible) hand-labeled 2D landmarks and ensures better definition alignment, without the need for 3D landmark datasets. To lift 2D landmarks to 3D, we leverage 3D-aware GANs for better multi-view consistency learning and in-the-wild multi-frame videos for robust cross-generalization. Empirical experiments demonstrate that our method not only achieves better definition alignment between 2D- 3D landmarks but also outperforms other supervised learning 3D landmark localization methods on both 3DMM labeled and photogrammetric ground truth evaluation datasets. Project Page: https://davidcferman.github.io/FaceLift
引用
收藏
页码:1781 / 1791
页数:11
相关论文
共 62 条
[1]  
An S., 2023, CoRR abs/2310.20689
[2]  
[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.01970
[3]  
[Anonymous], 2019, C COMP VIS PATT REC, DOI DOI 10.1109/CVPR.2019.01107
[4]  
[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.01565
[5]  
[Anonymous], 2016, CVPR, DOI DOI 10.1109/CVPR.2016.374
[6]  
[Anonymous], 2022, ECCV, DOI DOI 10.1007/978-3-031-20077-938
[7]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00337
[8]  
Ardisson Scott, 2022, arXiv
[9]  
Bagdanov AD., 2011, MM 11 PROC 2011 ACM, P79
[10]   Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses [J].
Bhagavatula, Chandrasekhar ;
Zhu, Chenchen ;
Luu, Khoa ;
Savvides, Marios .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4000-4009