A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引:1
|
作者
Shan, Yue [1 ]
Xiao, Jun [1 ]
Liu, Lupeng [1 ]
Wang, Yunbiao [1 ]
Yu, Dongbo [1 ]
Zhang, Wenniu [1 ]
机构
[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;
D O I
10.3390/rs16050901
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] A 3D Reconstruction Method with Color Reproduction from Multi-band and Multi-view Images
    Ito, Shuya
    Ito, Koichi
    Aoki, Takafumi
    Tsuchida, Masaru
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II, 2017, 10117 : 236 - 247
  • [32] A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images
    Gao, Jian
    Liu, Jin
    Ji, Shunping
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 195 : 446 - 461
  • [33] Edge-guided 3D reconstruction from multi-view sketches and RGB images
    Shi, Wuzhen
    Yin, Aixue
    Li, Yingxiang
    Wen, Yang
    PATTERN RECOGNITION, 2025, 163
  • [34] 3D SURFACE RECONSTRUCTION FROM MULTI-VIEW AND MULTI-DATE GOOGLE EARTH SATELLITE IMAGES WITH 3D HOMOGRAPHY-BASED PROJECTIVE RECONSTRUCTION
    Lee, M. J.
    Park, S. Y.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 135 - 140
  • [35] 3D building model reconstruction from multi-view aerial images and LiDAR data
    Cheng, Liang
    Gong, Jianya
    Li, Manchun
    Liu, Yongxue
    Song, Xiaogang
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2009, 38 (06): : 494 - 501
  • [36] Reconstruction of complete 3D object model from multi-view range images.
    Hung, YP
    Chen, CS
    Hsieh, IB
    Fuh, CS
    THREE-DIMENSIONAL IMAGE CAPTURE AND APPLICATIONS II, 1999, 3640 : 138 - 145
  • [37] Cortical Volumetry using 3D Reconstruction of Metacarpal Bone from Multi-view Images
    Jayakar, Avinash D.
    Sambath, Gautham
    Areeckal, Anu Shaju
    David, Sumam S.
    2018 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2018, : 79 - 83
  • [38] Automatic 3D building reconstruction from multi-view aerial images with deep learning
    Yu, Dawen
    Ji, Shunping
    Liu, Jin
    Wei, Shiqing
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 171 : 155 - 170
  • [39] 3D reconstruction of human bodies from single-view and multi-view images: A systematic review
    Correia, Helena A.
    Brito, Jose Henrique
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 239
  • [40] Region-focused multi-view transformer-based generative adversarial network for cardiac cine MRI reconstruction
    Lyu, Jun
    Li, Guangyuan
    Wang, Chengyan
    Qin, Chen
    Wang, Shuo
    Dou, Qi
    Qin, Jing
    MEDICAL IMAGE ANALYSIS, 2023, 85