3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition

被引:0
|
作者
Gao, Ziqi [1 ,2 ]
Li, Qiufu [1 ,2 ,3 ,4 ]
Shen, Linlin [1 ,2 ,3 ,4 ]
Yang, Junpeng [1 ,2 ]
机构
[1] Shenzhen Univ, Comp Vis Inst, Shenzhen 518060, Guangdong, Peoples R China
[2] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Guangdong, Peoples R China
[3] Shenzhen Inst Artificial Intelligence Robot Soc A, Shenzhen 518129, Guangdong, Peoples R China
[4] Shenzhen Univ, Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Face recognition; 3D point cloud;
D O I
10.1007/978-981-97-8795-1_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compared to 2D face recognition, 3D face recognition exhibits stronger robustness against variations like pose and illumination. However, due to the limited training data, the accuracy of existing 3D face recognition methods is still unsatisfactory. In this paper, we introduce 3DFaceMAE, which is the first masked autoencoder (MAE) based 3D face recognition method using point clouds. Specifically, we first synthesize a large-scale 3D point cloud facial dataset and combine it with the small-scale real data. In the pre-training of 3DFaceMAE, we extract the key facial regions from the input 3D facial point cloud, using normal difference techniques, and reconstruct these key regions using patch-based random masking reconstruction and super-resolution. We finally fine-tune the encoder of 3DFaceMAE on the real 3D face point cloud data. In the experiments, we test 3DFaceMAE on three 3D face datasets, as high as 91.17% was achieved on the Lock3DFace dataset, which is the first reported result surpassing 90%. In addition, the experimental results indicate that 3DFaceMAE has strong cross-quality generalization performance. We also validate the effectiveness of different components of 3DFaceMAE through ablation study.
引用
收藏
页码:488 / 503
页数:16
相关论文
共 45 条
  • [21] 3D Registration of pre-surgical prostate MRI and histopathology images via super-resolution volume reconstruction
    Sood, Rewa R.
    Shao, Wei
    Kunder, Christian
    Teslovich, Nikola C.
    Wang, Jeffrey B.
    Soerensen, Simon J. C.
    Madhuripan, Nikhil
    Jawahar, Anugayathri
    Brooks, James D.
    Ghanouni, Pejman
    Fan, Richard E.
    Sonn, Geoffrey A.
    Rusu, Mirabela
    MEDICAL IMAGE ANALYSIS, 2021, 69
  • [22] 3D landmark-based face restoration for recognition using variational autoencoder and triplet loss
    Sharma, Sahil
    Kumar, Vijay
    IET BIOMETRICS, 2021, 10 (01) : 87 - 98
  • [23] DYNAMIC 3D PET RECONSTRUCTION FOR KINETIC ANALYSIS USING PATCH-BASED LOW-RANK PENALTY
    Kim, K. S.
    Son, Y. D.
    Cho, Z. H.
    Ra, J. B.
    Ye, J. C.
    2012 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE RECORD (NSS/MIC), 2012, : 3430 - 3433
  • [24] Accurate Multiple View 3D Reconstruction Using Patch-Based Stereo for Large-Scale Scenes
    Shen, Shuhan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (05) : 1901 - 1914
  • [25] Super-resolution reconstruction of single anisotropic 3D MR images using residual convolutional neural network
    Du, Jinglong
    He, Zhongshi
    Wang, Lulu
    Gholipour, Ali
    Zhou, Zexun
    Chen, Dingding
    Jia, Yuanyuan
    NEUROCOMPUTING, 2020, 392 : 209 - 220
  • [26] Research on Super-Resolution Enhancement Technology Using Improved Transformer Network and 3D Reconstruction of Wheat Grains
    Tian, Yijun
    Zhang, Jinning
    Zhang, Zhongjie
    Wu, Jianjun
    IEEE ACCESS, 2024, 12 : 62882 - 62898
  • [27] 3D reconstruction and face recognition using kernel-based ICA and neural networks
    Kuo, Shye-Chorng
    Lin, Cheng-Jian
    Liao, Jan-Ray
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 5406 - 5415
  • [28] 3D-MRI super-resolution reconstruction using multi-modality based on multi-resolution CNN
    Kang, Li
    Tang, Bin
    Huang, Jianjun
    Li, Jianping
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 248
  • [29] 3D crack recognition in Engineered Cementitious Composites (ECC) based on super-resolution reconstruction and semantic segmentation of X-ray Computed Microtomography
    Hao, Zhexin
    Lu, Cong
    Dong, Biqin
    Li, Victor C.
    COMPOSITES PART B-ENGINEERING, 2024, 285
  • [30] Voxel-based 3D face reconstruction and its application to face recognition using sequential deep learning
    Sahil Sharma
    Vijay Kumar
    Multimedia Tools and Applications, 2020, 79 : 17303 - 17330