A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引:1
|
作者
Shan, Yue [1 ]
Xiao, Jun [1 ]
Liu, Lupeng [1 ]
Wang, Yunbiao [1 ]
Yu, Dongbo [1 ]
Zhang, Wenniu [1 ]
机构
[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;
D O I
10.3390/rs16050901
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Multi-View Normal Field Integration for 3D Reconstruction of Mirroring Objects
    Weinmann, Michael
    Osep, Aljosa
    Ruiters, Roland
    Klein, Reinhard
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2504 - 2511
  • [32] Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection
    Chen, Zehui
    Li, Zhenyu
    Zhang, Shiquan
    Fang, Liangji
    Jiang, Qinhong
    Zhao, Feng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5999 - 6008
  • [33] Wide-Baseline Multi-View Video Segmentation for 3D Reconstruction
    Sarim, Muhammad
    Hilton, Adrian
    Guillemaut, Jean-Yves
    Kim, Hansung
    Takai, Takeshi
    PROCEEDINGS OF THE 2010 ACM WORKSHOP ON 3D VIDEO PROCESSING (3DVP'10), 2010, : 13 - 18
  • [34] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
    Zhou, Kangkang
    Zhang, Lijun
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
  • [35] Research on Viewpoint Planning Method for Multi-view Image 3D Reconstruction
    Shi, Yun
    Zhu, Yanyan
    MANUFACTURING TECHNOLOGY, 2023, 23 (04): : 532 - 537
  • [36] Hierarchical Denoising Method of Crop 3D Point Cloud Based on Multi-view Image Reconstruction
    Chen, Lei
    Yuan, Yuan
    Song, Shide
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE XI, PT I, 2019, 545 : 416 - 427
  • [37] Dynamic Grouping With Multi-Manifold Attention for Multi-View 3D Object Reconstruction
    Kalitsios, Georgios
    Konstantinidis, Dimitrios
    Daras, Petros
    Dimitropoulos, Kosmas
    IEEE ACCESS, 2024, 12 : 160690 - 160699
  • [38] Multi-View Hierarchical Fusion Network for 3D Object Retrieval and Classification
    Liu, An-An
    Hu, Nian
    Song, Dan
    Guo, Fu-Bin
    Zhou, He-Yu
    Hao, Tong
    IEEE ACCESS, 2019, 7 : 153021 - 153030
  • [39] HAND-3D-STUDIO: A NEW MULTI-VIEW SYSTEM FOR 3D HAND RECONSTRUCTION
    Zhao, Zhengyi
    Wang, Tianyao
    Xia, Siyu
    Wang, Yangang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2478 - 2482
  • [40] An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval
    He, Xinwei
    Bai, Song
    Chu, Jiajia
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7917 - 7930