A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引:1
|
作者
Shan, Yue [1 ]
Xiao, Jun [1 ]
Liu, Lupeng [1 ]
Wang, Yunbiao [1 ]
Yu, Dongbo [1 ]
Zhang, Wenniu [1 ]
机构
[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;
D O I
10.3390/rs16050901
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] 3D model retrieval based on multi-view attentional convolutional neural network
    An-An Liu
    He-Yu Zhou
    Meng-Jie Li
    Wei-Zhi Nie
    Multimedia Tools and Applications, 2020, 79 : 4699 - 4711
  • [22] Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis
    Wei, Xin
    Yu, Ruixuan
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7525 - 7541
  • [23] Transformer-Based Stereo-Aware 3D Object Detection From Binocular Images
    Sun, Hanqing
    Pang, Yanwei
    Cao, Jiale
    Xie, Jin
    Li, Xuelong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 19675 - 19687
  • [24] Polarimetric multi-view 3D reconstruction based on parallax angle and zenith angle optimization
    Zhang Rui-Hua
    Shi Bo-Xin
    Yang Jin-Fa
    Zhao Hong-Ying
    Zuo Zheng-Kong
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2021, 40 (01) : 133 - 142
  • [25] A multi-view recurrent neural network for 3D mesh segmentation
    Le, Truc
    Bui, Giang
    Duan, Ye
    COMPUTERS & GRAPHICS-UK, 2017, 66 : 103 - 112
  • [26] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
    Zhang, Le
    Sun, Jian
    Zheng, Qiang
    SENSORS, 2018, 18 (11)
  • [27] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04) : 3201 - 3212
  • [28] Multi-view 3D reconstruction and modeling of the unknown 3D scenes using genetic algorithms
    Merras, Mostafa
    Saaidi, Abderrahim
    El Akkad, Nabil
    Satori, Khalid
    SOFT COMPUTING, 2018, 22 (19) : 6271 - 6289
  • [29] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [30] Multi-View 3D Scene Abstraction From Drone-Captured RGB Images
    Jeong, Wooseong
    Kim, Jihun
    Kweon, Hyeokjun
    Yoon, Kuk-Jin
    IEEE ACCESS, 2025, 13 : 27641 - 27656