Decoding the third dimension in the metaverse: A comprehensive method for reconstructing 2D NFT portraits into 3D models

被引:1
|
作者
Deng, Erqiang [1 ]
You, Li [2 ]
Khan, Fazlullah [3 ]
Zhu, Guosong [1 ]
Qin, Zhen [1 ]
Kumari, Saru [4 ]
Xiong, Hu [1 ]
Alturki, Ryan [5 ]
机构
[1] Univ Elect Sci & Technol China, Network & Data Secur Key Lab Sichuan Prov, Chengdu, Peoples R China
[2] Erasmus MC, Dept Mol Genet, Rotterdam, Netherlands
[3] Univ Nottingham Ningbo China, Fac Sci & Engn, Sch Comp Sci, Ningbo 315104, Zhejiang, Peoples R China
[4] Chaudhary Charan Singh Univ, Dept Math, Meerut 250004, Uttar Pradesh, India
[5] Umm Al Qura Univ, Coll Comp, Dept Software Engn, Mecca, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Metaverse; NFT; 3D reconstruction; Decoupling autoencoder;
D O I
10.1016/j.asoc.2024.111964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Metaverse, 3D modeling techniques and autoencoders offer a novel approach for handling 2D portraits of Non-Fungible Tokens (NFTs). These techniques have significant applications in the metaverse, a virtual, shared, and persistently online space that combines the real world, virtual reality, and augmented reality. Within the metaverse, NFTs can represent virtual items and assets, and 3D modeling techniques can be used to create three-dimensional models of these virtual items and assets. In this paper, we propose a novel method of inferring 3D structure and texture from 2D Non-Fungible Token (NFT) portraits using image-decoupled autoencoders. By implementing 3D facial modeling, depth values are associated with each pixel in the canonical view, thereby modeling 3D faces with fine textures and accurate structures from 2D NFT portraits. The input image is decomposed into four elements: depth map, albedo image, light direction, and viewpoint, all of which are used in the 3D reconstruction process. Asymmetry in NFT portraits is also addressed, and a symmetry confidence map is used to record the symmetry prediction probability for each pixel. In the experimental section, datasets including human faces and anime faces are used to better adapt to the diverse styles of NFT images. The Adam optimizer is used for training, and a set of new evaluation metrics, including cosine similarity, PSNR, SSIM, and LPIPS, are used to assess the quality of texture reconstruction. The proposed method achieves state-of-the-art performance in 3D facial reconstruction and performs exceptionally well in 3D facial reconstruction of anime faces compared to other methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Security and Privacy Protection Obstacles with 3D Reconstructed Models of People in Applications and the Metaverse: A Survey
    Vladimirov, Ivaylo
    Nenova, Maria
    Nikolova, Desislava
    Terneva, Zornitsa
    2022 57TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2022, : 88 - 91
  • [32] 3D Shape from Silhouette Points in Registered 2D Images Using Conjugate Gradient Method
    Szymczak, Andrzej
    Hoff, William
    Mahfouz, Mohamed
    MEDICAL IMAGING 2010: IMAGE PROCESSING, 2010, 7623
  • [33] Double Reference Guided Interactive 2D and 3D Caricature Generation
    Huang, Xin
    Liang, Dong
    Cai, Hongrui
    Bai, Yunfeng
    Zhang, Juyong
    Tian, Feng
    Jia, Jinyuan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [34] Study on 3D Model Reconstruction of Vehicles from 2D Images
    Zhang, Huaishan
    Gao, Guanbin
    Li, Bo
    MACHINE DESIGN AND MANUFACTURING ENGINEERING III, 2014, : 625 - 628
  • [35] A 2D image 3D reconstruction function adaptive denoising algorithm
    Wang, Feng
    Ni, Weichuan
    Liu, Shaojiang
    Xu, Zhiming
    Qiu, Zemin
    Wan, Zhiping
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [36] A Review of Deep Learning Techniques for 3D Reconstruction of 2D Images
    Yuniarti, Anny
    Suciati, Nanik
    PROCEEDINGS OF 2019 12TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2019, : 327 - 331
  • [37] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
    Wan, Yingcai
    Fang, Lijin
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
  • [38] The Precise 3D Reconstruction of Human Faces Based on 2D Photograph
    Suo, Xiaoyuan
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 1185 - 1190
  • [39] Rotational-symmetry in a 3D scene and its 2D image
    Sawada, Tadamasa
    Zaidi, Qasim
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2018, 87 : 108 - 125
  • [40] A 2D image 3D reconstruction function adaptive denoising algorithm
    Wang F.
    Ni W.
    Liu S.
    Xu Z.
    Qiu Z.
    Wan Z.
    PeerJ Computer Science, 2023, 9 : 1 - 17