3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition

被引:0
|
作者
Gao, Ziqi [1 ,2 ]
Li, Qiufu [1 ,2 ,3 ,4 ]
Shen, Linlin [1 ,2 ,3 ,4 ]
Yang, Junpeng [1 ,2 ]
机构
[1] Shenzhen Univ, Comp Vis Inst, Shenzhen 518060, Guangdong, Peoples R China
[2] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Guangdong, Peoples R China
[3] Shenzhen Inst Artificial Intelligence Robot Soc A, Shenzhen 518129, Guangdong, Peoples R China
[4] Shenzhen Univ, Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Face recognition; 3D point cloud;
D O I
10.1007/978-981-97-8795-1_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compared to 2D face recognition, 3D face recognition exhibits stronger robustness against variations like pose and illumination. However, due to the limited training data, the accuracy of existing 3D face recognition methods is still unsatisfactory. In this paper, we introduce 3DFaceMAE, which is the first masked autoencoder (MAE) based 3D face recognition method using point clouds. Specifically, we first synthesize a large-scale 3D point cloud facial dataset and combine it with the small-scale real data. In the pre-training of 3DFaceMAE, we extract the key facial regions from the input 3D facial point cloud, using normal difference techniques, and reconstruct these key regions using patch-based random masking reconstruction and super-resolution. We finally fine-tune the encoder of 3DFaceMAE on the real 3D face point cloud data. In the experiments, we test 3DFaceMAE on three 3D face datasets, as high as 91.17% was achieved on the Lock3DFace dataset, which is the first reported result surpassing 90%. In addition, the experimental results indicate that 3DFaceMAE has strong cross-quality generalization performance. We also validate the effectiveness of different components of 3DFaceMAE through ablation study.
引用
收藏
页码:488 / 503
页数:16
相关论文
共 45 条
  • [31] Voxel-based 3D face reconstruction and its application to face recognition using sequential deep learning
    Sharma, Sahil
    Kumar, Vijay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (25-26) : 17303 - 17330
  • [32] Development of 3D patient-based super-resolution digital breast phantoms using machine learning
    Caballo, Marco
    Fedon, Christian
    Brombal, Luca
    Mann, Ritse
    Longo, Renata
    Sechopoulos, Ioannis
    PHYSICS IN MEDICINE AND BIOLOGY, 2018, 63 (22):
  • [33] Fractal geometry-based automatic generation of large-scale image database for pre-training in 3D object recognition
    Yamada R.
    Okayasu K.
    Nakamura A.
    Kataoka H.
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2021, 87 (04): : 374 - 379
  • [34] Super-resolution Reconstruction and Denoising of 3D Millimetre-wave Images using a Complex-valued Convolutional Neural Network
    Sharmal, Rahul
    Zhangl, Jiaming
    Kumarl, Rupesh
    Deka, Bhabesh
    Fuscol, Vincent
    Yurdusevenl, Okan
    2023 17TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION, EUCAP, 2023,
  • [35] Intensifying the spatial resolution of 3D thermal models from aerial imagery using deep learning-based image super-resolution
    Fallah, Alaleh
    Samadzadegan, Farhad
    Javan, Farzaneh Dadrass
    GEOCARTO INTERNATIONAL, 2022, 37 (26) : 13518 - 13538
  • [36] Three-Dimensional-Slice-Super-Resolution-Net: A Fast Few Shooting Learning Model for 3D Super-Resolution Using Slice-Up and Slice-Reconstruction
    Lin, Hongbin
    Xu, Qingfeng
    Xu, Handing
    Xu, Yanjie
    Zheng, Yiming
    Zhong, Yubin
    Nie, Zhenguo
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2024, 24 (01)
  • [37] State-of-the-art of 3D facial reconstruction methods for face recognition based on a single 2D training image per person
    Levine, Martin D.
    Yu, Yingfeng
    PATTERN RECOGNITION LETTERS, 2009, 30 (10) : 908 - 913
  • [38] Learning-Based Super-Resolution Imaging of Turbulent Flames in Both Time and 3D Space Using Double GAN Architectures
    Zheng, Chenxu
    Huang, Weiming
    Xu, Wenjiang
    FIRE-SWITZERLAND, 2024, 7 (08):
  • [39] Improving Seismic Fault Recognition with Self-Supervised Pre-Training: A Study of 3D Transformer-Based with Multi-Scale Decoding and Fusion
    Zhang, Zeren
    Chen, Ran
    Ma, Jinwen
    REMOTE SENSING, 2024, 16 (05)
  • [40] 3D whole heart k-space-based super-resolution cardiac T1 mapping using rotated stacks
    Hufnagel, Simone
    Schuenke, Patrick
    Schulz-Menger, Jeanette
    Schaeffter, Tobias
    Kolbitsch, Christoph
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (08):