A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引：1

作者：

Shan, Yue ^{[1
]}

Xiao, Jun ^{[1
]}

Liu, Lupeng ^{[1
]}

Wang, Yunbiao ^{[1
]}

Yu, Dongbo ^{[1
]}

Zhang, Wenniu ^{[1
]}

机构：

[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 05期

基金：

中国国家自然科学基金;

关键词：

point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;

D O I：

10.3390/rs16050901

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.

引用

页数：18

共 50 条

[41] Disentangling 3D/4D Facial Affect Recognition With Faster Multi-View Transformer
Behzad, Muzammil
Li, Xiaobai
Zhao, Guoying
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1913 - 1917
[42] On Multiple-view Matrix Based 3D Reconstruction from Multiple-view Images
Huang, Huimin
Zhao, Ruibin
Pang, Mingyong
2018 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2018, : 114 - 119
[43] Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images
Li, Zhongguo
Oskarsson, Magnus
Heyden, Anders
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8968 - 8975
[44] Multi-view 3D scene reconstruction using ant colony optimization techniques
Chrysostomou, Dimitrios
Gasteratos, Antonios
Nalpantidis, Lazaros
Sirakoulis, Georgios C.
MEASUREMENT SCIENCE AND TECHNOLOGY, 2012, 23 (11)
[45] Detailed 3D human body reconstruction from multi-view images combining voxel super-resolution and learned implicit representation
Li, Zhongguo
Oskarsson, Magnus
Heyden, Anders
APPLIED INTELLIGENCE, 2022, 52 (06) : 6739 - 6759
[46] High-Accuracy Mapping Design Based on Multi-view Images and 3D LiDAR Point Clouds
Chen, Jian-Hong
Lin, Guo-Han
Yelamandala, Chitra Meghala
Fan, Yu-Cheng
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 90 - 91
[47] Multi-view indoor human behavior recognition based on 3D skeleton
Peng, Ling
Lu, Tongwei
Min, Feng
MIPPR 2015: PATTERN RECOGNITION AND COMPUTER VISION, 2015, 9813
[48] 3D Object Detection based on Multi-View Feature Point Matching
Yang, Tian
Sang, Xinzhu
Chen, Duo
Guo, Nan
Wang, Peng
Yu, Xunbo
Yan, Binbin
Wang, Kuiru
Yu, Chongxiu
AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
[49] 3D Object Retrieval Based on Multi-View Latent Variable Model
Liu, An-An
Nie, Wei-Zhi
Su, Yu-Ting
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (03) : 868 - 880
[50] NRTR: Neuron Reconstruction With Transformer From 3D Optical Microscopy Images
Wang, Yijun
Lang, Rui
Li, Rui
Zhang, Junsong
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (02) : 886 - 898

← 1 2 3 4 5 →