Decoding the third dimension in the metaverse: A comprehensive method for reconstructing 2D NFT portraits into 3D models

被引：1

作者：

Deng, Erqiang ^{[1
]}

You, Li ^{[2
]}

Khan, Fazlullah ^{[3
]}

Zhu, Guosong ^{[1
]}

Qin, Zhen ^{[1
]}

Kumari, Saru ^{[4
]}

Xiong, Hu ^{[1
]}

Alturki, Ryan ^{[5
]}

机构：

[1] Univ Elect Sci & Technol China, Network & Data Secur Key Lab Sichuan Prov, Chengdu, Peoples R China

[2] Erasmus MC, Dept Mol Genet, Rotterdam, Netherlands

[3] Univ Nottingham Ningbo China, Fac Sci & Engn, Sch Comp Sci, Ningbo 315104, Zhejiang, Peoples R China

[4] Chaudhary Charan Singh Univ, Dept Math, Meerut 250004, Uttar Pradesh, India

[5] Umm Al Qura Univ, Coll Comp, Dept Software Engn, Mecca, Saudi Arabia

来源：

APPLIED SOFT COMPUTING | 2024年 / 165卷

基金：

中国国家自然科学基金;

关键词：

Metaverse; NFT; 3D reconstruction; Decoupling autoencoder;

D O I：

10.1016/j.asoc.2024.111964

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the Metaverse, 3D modeling techniques and autoencoders offer a novel approach for handling 2D portraits of Non-Fungible Tokens (NFTs). These techniques have significant applications in the metaverse, a virtual, shared, and persistently online space that combines the real world, virtual reality, and augmented reality. Within the metaverse, NFTs can represent virtual items and assets, and 3D modeling techniques can be used to create three-dimensional models of these virtual items and assets. In this paper, we propose a novel method of inferring 3D structure and texture from 2D Non-Fungible Token (NFT) portraits using image-decoupled autoencoders. By implementing 3D facial modeling, depth values are associated with each pixel in the canonical view, thereby modeling 3D faces with fine textures and accurate structures from 2D NFT portraits. The input image is decomposed into four elements: depth map, albedo image, light direction, and viewpoint, all of which are used in the 3D reconstruction process. Asymmetry in NFT portraits is also addressed, and a symmetry confidence map is used to record the symmetry prediction probability for each pixel. In the experimental section, datasets including human faces and anime faces are used to better adapt to the diverse styles of NFT images. The Adam optimizer is used for training, and a set of new evaluation metrics, including cosine similarity, PSNR, SSIM, and LPIPS, are used to assess the quality of texture reconstruction. The proposed method achieves state-of-the-art performance in 3D facial reconstruction and performs exceptionally well in 3D facial reconstruction of anime faces compared to other methods.

引用

页数：8

共 50 条

[31] Security and Privacy Protection Obstacles with 3D Reconstructed Models of People in Applications and the Metaverse: A Survey
Vladimirov, Ivaylo
Nenova, Maria
Nikolova, Desislava
Terneva, Zornitsa
2022 57TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2022, : 88 - 91
[32] 3D Shape from Silhouette Points in Registered 2D Images Using Conjugate Gradient Method
Szymczak, Andrzej
Hoff, William
Mahfouz, Mohamed
MEDICAL IMAGING 2010: IMAGE PROCESSING, 2010, 7623
[33] Double Reference Guided Interactive 2D and 3D Caricature Generation
Huang, Xin
Liang, Dong
Cai, Hongrui
Bai, Yunfeng
Zhang, Juyong
Tian, Feng
Jia, Jinyuan
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
[34] Study on 3D Model Reconstruction of Vehicles from 2D Images
Zhang, Huaishan
Gao, Guanbin
Li, Bo
MACHINE DESIGN AND MANUFACTURING ENGINEERING III, 2014, : 625 - 628
[35] A 2D image 3D reconstruction function adaptive denoising algorithm
Wang, Feng
Ni, Weichuan
Liu, Shaojiang
Xu, Zhiming
Qiu, Zemin
Wan, Zhiping
PEERJ COMPUTER SCIENCE, 2023, 9
[36] A Review of Deep Learning Techniques for 3D Reconstruction of 2D Images
Yuniarti, Anny
Suciati, Nanik
PROCEEDINGS OF 2019 12TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2019, : 327 - 331
[37] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
Wan, Yingcai
Fang, Lijin
IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
[38] The Precise 3D Reconstruction of Human Faces Based on 2D Photograph
Suo, Xiaoyuan
2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 1185 - 1190
[39] Rotational-symmetry in a 3D scene and its 2D image
Sawada, Tadamasa
Zaidi, Qasim
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2018, 87 : 108 - 125
[40] A 2D image 3D reconstruction function adaptive denoising algorithm
Wang F.
Ni W.
Liu S.
Xu Z.
Qiu Z.
Wan Z.
PeerJ Computer Science, 2023, 9 : 1 - 17

← 1 2 3 4 5 →