Decoding the third dimension in the metaverse: A comprehensive method for reconstructing 2D NFT portraits into 3D models

被引：1

作者：

Deng, Erqiang ^{[1
]}

You, Li ^{[2
]}

Khan, Fazlullah ^{[3
]}

Zhu, Guosong ^{[1
]}

Qin, Zhen ^{[1
]}

Kumari, Saru ^{[4
]}

Xiong, Hu ^{[1
]}

Alturki, Ryan ^{[5
]}

机构：

[1] Univ Elect Sci & Technol China, Network & Data Secur Key Lab Sichuan Prov, Chengdu, Peoples R China

[2] Erasmus MC, Dept Mol Genet, Rotterdam, Netherlands

[3] Univ Nottingham Ningbo China, Fac Sci & Engn, Sch Comp Sci, Ningbo 315104, Zhejiang, Peoples R China

[4] Chaudhary Charan Singh Univ, Dept Math, Meerut 250004, Uttar Pradesh, India

[5] Umm Al Qura Univ, Coll Comp, Dept Software Engn, Mecca, Saudi Arabia

来源：

APPLIED SOFT COMPUTING | 2024年 / 165卷

基金：

中国国家自然科学基金;

关键词：

Metaverse; NFT; 3D reconstruction; Decoupling autoencoder;

D O I：

10.1016/j.asoc.2024.111964

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the Metaverse, 3D modeling techniques and autoencoders offer a novel approach for handling 2D portraits of Non-Fungible Tokens (NFTs). These techniques have significant applications in the metaverse, a virtual, shared, and persistently online space that combines the real world, virtual reality, and augmented reality. Within the metaverse, NFTs can represent virtual items and assets, and 3D modeling techniques can be used to create three-dimensional models of these virtual items and assets. In this paper, we propose a novel method of inferring 3D structure and texture from 2D Non-Fungible Token (NFT) portraits using image-decoupled autoencoders. By implementing 3D facial modeling, depth values are associated with each pixel in the canonical view, thereby modeling 3D faces with fine textures and accurate structures from 2D NFT portraits. The input image is decomposed into four elements: depth map, albedo image, light direction, and viewpoint, all of which are used in the 3D reconstruction process. Asymmetry in NFT portraits is also addressed, and a symmetry confidence map is used to record the symmetry prediction probability for each pixel. In the experimental section, datasets including human faces and anime faces are used to better adapt to the diverse styles of NFT images. The Adam optimizer is used for training, and a set of new evaluation metrics, including cosine similarity, PSNR, SSIM, and LPIPS, are used to assess the quality of texture reconstruction. The proposed method achieves state-of-the-art performance in 3D facial reconstruction and performs exceptionally well in 3D facial reconstruction of anime faces compared to other methods.

引用

页数：8

共 50 条

[21] A fast 3D scene reconstructing method using continuous video
Bo-Yi Sung
Chang-Hong Lin
EURASIP Journal on Image and Video Processing, 2017
[22] RECONSTRUCTING PART-LEVEL 3D MODELS FROM A SINGLE IMAGE
Shi, Dingfeng
Zhao, Yifan
Li, Jia
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[23] Reconstructing 3D Face Models by Incremental Aggregation and Refinement of Depth Frames
Pala, Pietro
Berretti, Stefano
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
[24] Reconstructing human cerebral vasculature in 3D with high frame rate, freehand 2D Doppler ultrasound using optical tracking
Verhoef, Luuk
Soloukey, Sadaf
Mastik, Frits
Generowicz, Bastian S.
Vincent, Arnaud J. P. E.
Bos, Eelke M.
Schouten, Joost W.
Dirven, Clemens M. F.
De Zeeuw, Chris I.
Koekkoek, Sebastiaan K. E.
Klein, Stefan
Kruizinga, Pieter
2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
[25] 3D SHAPE RECONSTRUCTION FROM 2D ISAR MEASUREMENTS
Sun, Jing
Shang, She
Xu, Jia-Dong
2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 25 - 28
[26] 2D TO 3D CONVERSION OF SPORTS CONTENT USING PANORAMAS
Schnyder, Lars
Wang, Oliver
Smolic, Aljoscha
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
[27] 3D Reconstruction of Garment from a Single 2D Image
Liu, Hongyan
PROCEEDINGS OF THE FIBER SOCIETY 2009 SPRING CONFERENCE, VOLS I AND II, 2009, : 1165 - 1168
[28] 3D Kidney Reconstruction from 2D Ultrasound Images
Teresa Alvarez-Gutierrez, Mariana
Rodrigo Mejia-Rodriguez, Aldo
Alejandro Cruz-Guerrero, Ines
Roman Arce-Santana, Edgar
VIII LATIN AMERICAN CONFERENCE ON BIOMEDICAL ENGINEERING AND XLII NATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2020, 75 : 393 - 400
[29] 2D/3D Image Converter Based on Overlapping Line
Fan, Yu-Cheng
Chiu, Yi-Chih
Chang, Li-Cheng
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022), 2022,
[30] Constraint-based beautification and dimensioning of 3D polyhedral models reconstructed from 2D sketches
Zou, H. L.
Lee, Y. T.
COMPUTER-AIDED DESIGN, 2007, 39 (11) : 1025 - 1036

← 1 2 3 4 5 →