A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引：1

作者：

Shan, Yue ^{[1
]}

Xiao, Jun ^{[1
]}

Liu, Lupeng ^{[1
]}

Wang, Yunbiao ^{[1
]}

Yu, Dongbo ^{[1
]}

Zhang, Wenniu ^{[1
]}

机构：

[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 05期

基金：

中国国家自然科学基金;

关键词：

point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;

D O I：

10.3390/rs16050901

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.

引用

页数：18

共 50 条

[21] 3D model retrieval based on multi-view attentional convolutional neural network
An-An Liu
He-Yu Zhou
Meng-Jie Li
Wei-Zhi Nie
Multimedia Tools and Applications, 2020, 79 : 4699 - 4711
[22] Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis
Wei, Xin
Yu, Ruixuan
Sun, Jian
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7525 - 7541
[23] Transformer-Based Stereo-Aware 3D Object Detection From Binocular Images
Sun, Hanqing
Pang, Yanwei
Cao, Jiale
Xie, Jin
Li, Xuelong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 19675 - 19687
[24] Polarimetric multi-view 3D reconstruction based on parallax angle and zenith angle optimization
Zhang Rui-Hua
Shi Bo-Xin
Yang Jin-Fa
Zhao Hong-Ying
Zuo Zheng-Kong
JOURNAL OF INFRARED AND MILLIMETER WAVES, 2021, 40 (01) : 133 - 142
[25] A multi-view recurrent neural network for 3D mesh segmentation
Le, Truc
Bui, Giang
Duan, Ye
COMPUTERS & GRAPHICS-UK, 2017, 66 : 103 - 112
[26] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
Zhang, Le
Sun, Jian
Zheng, Qiang
SENSORS, 2018, 18 (11)
[27] Multi-view dual attention network for 3D object recognition
Wang, Wenju
Cai, Yu
Wang, Tao
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04) : 3201 - 3212
[28] Multi-view 3D reconstruction and modeling of the unknown 3D scenes using genetic algorithms
Merras, Mostafa
Saaidi, Abderrahim
El Akkad, Nabil
Satori, Khalid
SOFT COMPUTING, 2018, 22 (19) : 6271 - 6289
[29] Multi-view dual attention network for 3D object recognition
Wenju Wang
Yu Cai
Tao Wang
Neural Computing and Applications, 2022, 34 : 3201 - 3212
[30] Multi-View 3D Scene Abstraction From Drone-Captured RGB Images
Jeong, Wooseong
Kim, Jihun
Kweon, Hyeokjun
Yoon, Kuk-Jin
IEEE ACCESS, 2025, 13 : 27641 - 27656

← 1 2 3 4 5 →