Decoding the third dimension in the metaverse: A comprehensive method for reconstructing 2D NFT portraits into 3D models

被引:1
|
作者
Deng, Erqiang [1 ]
You, Li [2 ]
Khan, Fazlullah [3 ]
Zhu, Guosong [1 ]
Qin, Zhen [1 ]
Kumari, Saru [4 ]
Xiong, Hu [1 ]
Alturki, Ryan [5 ]
机构
[1] Univ Elect Sci & Technol China, Network & Data Secur Key Lab Sichuan Prov, Chengdu, Peoples R China
[2] Erasmus MC, Dept Mol Genet, Rotterdam, Netherlands
[3] Univ Nottingham Ningbo China, Fac Sci & Engn, Sch Comp Sci, Ningbo 315104, Zhejiang, Peoples R China
[4] Chaudhary Charan Singh Univ, Dept Math, Meerut 250004, Uttar Pradesh, India
[5] Umm Al Qura Univ, Coll Comp, Dept Software Engn, Mecca, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Metaverse; NFT; 3D reconstruction; Decoupling autoencoder;
D O I
10.1016/j.asoc.2024.111964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Metaverse, 3D modeling techniques and autoencoders offer a novel approach for handling 2D portraits of Non-Fungible Tokens (NFTs). These techniques have significant applications in the metaverse, a virtual, shared, and persistently online space that combines the real world, virtual reality, and augmented reality. Within the metaverse, NFTs can represent virtual items and assets, and 3D modeling techniques can be used to create three-dimensional models of these virtual items and assets. In this paper, we propose a novel method of inferring 3D structure and texture from 2D Non-Fungible Token (NFT) portraits using image-decoupled autoencoders. By implementing 3D facial modeling, depth values are associated with each pixel in the canonical view, thereby modeling 3D faces with fine textures and accurate structures from 2D NFT portraits. The input image is decomposed into four elements: depth map, albedo image, light direction, and viewpoint, all of which are used in the 3D reconstruction process. Asymmetry in NFT portraits is also addressed, and a symmetry confidence map is used to record the symmetry prediction probability for each pixel. In the experimental section, datasets including human faces and anime faces are used to better adapt to the diverse styles of NFT images. The Adam optimizer is used for training, and a set of new evaluation metrics, including cosine similarity, PSNR, SSIM, and LPIPS, are used to assess the quality of texture reconstruction. The proposed method achieves state-of-the-art performance in 3D facial reconstruction and performs exceptionally well in 3D facial reconstruction of anime faces compared to other methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A fast 3D scene reconstructing method using continuous video
    Bo-Yi Sung
    Chang-Hong Lin
    EURASIP Journal on Image and Video Processing, 2017
  • [22] RECONSTRUCTING PART-LEVEL 3D MODELS FROM A SINGLE IMAGE
    Shi, Dingfeng
    Zhao, Yifan
    Li, Jia
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [23] Reconstructing 3D Face Models by Incremental Aggregation and Refinement of Depth Frames
    Pala, Pietro
    Berretti, Stefano
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [24] Reconstructing human cerebral vasculature in 3D with high frame rate, freehand 2D Doppler ultrasound using optical tracking
    Verhoef, Luuk
    Soloukey, Sadaf
    Mastik, Frits
    Generowicz, Bastian S.
    Vincent, Arnaud J. P. E.
    Bos, Eelke M.
    Schouten, Joost W.
    Dirven, Clemens M. F.
    De Zeeuw, Chris I.
    Koekkoek, Sebastiaan K. E.
    Klein, Stefan
    Kruizinga, Pieter
    2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
  • [25] 3D SHAPE RECONSTRUCTION FROM 2D ISAR MEASUREMENTS
    Sun, Jing
    Shang, She
    Xu, Jia-Dong
    2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 25 - 28
  • [26] 2D TO 3D CONVERSION OF SPORTS CONTENT USING PANORAMAS
    Schnyder, Lars
    Wang, Oliver
    Smolic, Aljoscha
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [27] 3D Reconstruction of Garment from a Single 2D Image
    Liu, Hongyan
    PROCEEDINGS OF THE FIBER SOCIETY 2009 SPRING CONFERENCE, VOLS I AND II, 2009, : 1165 - 1168
  • [28] 3D Kidney Reconstruction from 2D Ultrasound Images
    Teresa Alvarez-Gutierrez, Mariana
    Rodrigo Mejia-Rodriguez, Aldo
    Alejandro Cruz-Guerrero, Ines
    Roman Arce-Santana, Edgar
    VIII LATIN AMERICAN CONFERENCE ON BIOMEDICAL ENGINEERING AND XLII NATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2020, 75 : 393 - 400
  • [29] 2D/3D Image Converter Based on Overlapping Line
    Fan, Yu-Cheng
    Chiu, Yi-Chih
    Chang, Li-Cheng
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022), 2022,
  • [30] Constraint-based beautification and dimensioning of 3D polyhedral models reconstructed from 2D sketches
    Zou, H. L.
    Lee, Y. T.
    COMPUTER-AIDED DESIGN, 2007, 39 (11) : 1025 - 1036