FaceLift: Semi-supervised 3D Facial Landmark Localization

被引：2

作者：

Ferman, David ^{[1
]}

Garrido, Pablo ^{[1
]}

Bharaj, Gaurav ^{[1
]}

机构：

[1] Flawless AI, London, England

来源：

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年

关键词：

D O I：

10.1109/CVPR52733.2024.00175

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D facial landmark localization has proven to be of particular use for applications, such as face tracking, 3D face modeling, and image-based 3D face reconstruction. In the supervised learning case, such methods usually rely on 3D landmark datasets derived from 3DMM-based registration that often lack spatial definition alignment, as compared with that chosen by hand-labeled human consensus, e.g., how are eyebrow landmarks defined? This creates a gap between landmark datasets generated via high-quality 2D human labels and 3DMMs, and it ultimately limits their effectiveness. To address this issue, we introduce a novel semi- supervised learning approach that learns 3D landmarks by directly lifting (visible) hand-labeled 2D landmarks and ensures better definition alignment, without the need for 3D landmark datasets. To lift 2D landmarks to 3D, we leverage 3D-aware GANs for better multi-view consistency learning and in-the-wild multi-frame videos for robust cross-generalization. Empirical experiments demonstrate that our method not only achieves better definition alignment between 2D- 3D landmarks but also outperforms other supervised learning 3D landmark localization methods on both 3DMM labeled and photogrammetric ground truth evaluation datasets. Project Page: https://davidcferman.github.io/FaceLift

引用

页码：1781 / 1791

页数：11

共 62 条

[1]

An S., 2023, CoRR abs/2310.20689

[2]

[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.01970

[3]

[Anonymous], 2019, C COMP VIS PATT REC, DOI DOI 10.1109/CVPR.2019.01107

[4]

[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.01565

[5]

[Anonymous], 2016, CVPR, DOI DOI 10.1109/CVPR.2016.374

[6]

[Anonymous], 2022, ECCV, DOI DOI 10.1007/978-3-031-20077-938

[7]

[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00337

[8]

Ardisson Scott, 2022, arXiv

[9]

Bagdanov AD., 2011, MM 11 PROC 2011 ACM, P79

[10] Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses [J].

Bhagavatula, Chandrasekhar ;

Zhu, Chenchen ;

Luu, Khoa ;

Savvides, Marios .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4000-4009

← 1 2 3 4 5 6 7 →