A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引：1

作者：

Shan, Yue ^{[1
]}

Xiao, Jun ^{[1
]}

Liu, Lupeng ^{[1
]}

Wang, Yunbiao ^{[1
]}

Yu, Dongbo ^{[1
]}

Zhang, Wenniu ^{[1
]}

机构：

[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 05期

基金：

中国国家自然科学基金;

关键词：

point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;

D O I：

10.3390/rs16050901

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.

引用

页数：18

共 50 条

[31] Multi-View Normal Field Integration for 3D Reconstruction of Mirroring Objects
Weinmann, Michael
Osep, Aljosa
Ruiters, Roland
Klein, Reinhard
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2504 - 2511
[32] Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection
Chen, Zehui
Li, Zhenyu
Zhang, Shiquan
Fang, Liangji
Jiang, Qinhong
Zhao, Feng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5999 - 6008
[33] Wide-Baseline Multi-View Video Segmentation for 3D Reconstruction
Sarim, Muhammad
Hilton, Adrian
Guillemaut, Jean-Yves
Kim, Hansung
Takai, Takeshi
PROCEEDINGS OF THE 2010 ACM WORKSHOP ON 3D VIDEO PROCESSING (3DVP'10), 2010, : 13 - 18
[34] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
Zhou, Kangkang
Zhang, Lijun
Lu, Feng
Zhou, Xiang-Dong
Shi, Yu
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
[35] Research on Viewpoint Planning Method for Multi-view Image 3D Reconstruction
Shi, Yun
Zhu, Yanyan
MANUFACTURING TECHNOLOGY, 2023, 23 (04): : 532 - 537
[36] Hierarchical Denoising Method of Crop 3D Point Cloud Based on Multi-view Image Reconstruction
Chen, Lei
Yuan, Yuan
Song, Shide
COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE XI, PT I, 2019, 545 : 416 - 427
[37] Dynamic Grouping With Multi-Manifold Attention for Multi-View 3D Object Reconstruction
Kalitsios, Georgios
Konstantinidis, Dimitrios
Daras, Petros
Dimitropoulos, Kosmas
IEEE ACCESS, 2024, 12 : 160690 - 160699
[38] Multi-View Hierarchical Fusion Network for 3D Object Retrieval and Classification
Liu, An-An
Hu, Nian
Song, Dan
Guo, Fu-Bin
Zhou, He-Yu
Hao, Tong
IEEE ACCESS, 2019, 7 : 153021 - 153030
[39] HAND-3D-STUDIO: A NEW MULTI-VIEW SYSTEM FOR 3D HAND RECONSTRUCTION
Zhao, Zhengyi
Wang, Tianyao
Xia, Siyu
Wang, Yangang
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2478 - 2482
[40] An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval
He, Xinwei
Bai, Song
Chu, Jiajia
Bai, Xiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7917 - 7930

← 1 2 3 4 5 →