A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引:1
|
作者
Shan, Yue [1 ]
Xiao, Jun [1 ]
Liu, Lupeng [1 ]
Wang, Yunbiao [1 ]
Yu, Dongbo [1 ]
Zhang, Wenniu [1 ]
机构
[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;
D O I
10.3390/rs16050901
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Disentangling 3D/4D Facial Affect Recognition With Faster Multi-View Transformer
    Behzad, Muzammil
    Li, Xiaobai
    Zhao, Guoying
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1913 - 1917
  • [42] On Multiple-view Matrix Based 3D Reconstruction from Multiple-view Images
    Huang, Huimin
    Zhao, Ruibin
    Pang, Mingyong
    2018 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2018, : 114 - 119
  • [43] Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images
    Li, Zhongguo
    Oskarsson, Magnus
    Heyden, Anders
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8968 - 8975
  • [44] Multi-view 3D scene reconstruction using ant colony optimization techniques
    Chrysostomou, Dimitrios
    Gasteratos, Antonios
    Nalpantidis, Lazaros
    Sirakoulis, Georgios C.
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2012, 23 (11)
  • [45] Detailed 3D human body reconstruction from multi-view images combining voxel super-resolution and learned implicit representation
    Li, Zhongguo
    Oskarsson, Magnus
    Heyden, Anders
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6739 - 6759
  • [46] High-Accuracy Mapping Design Based on Multi-view Images and 3D LiDAR Point Clouds
    Chen, Jian-Hong
    Lin, Guo-Han
    Yelamandala, Chitra Meghala
    Fan, Yu-Cheng
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 90 - 91
  • [47] Multi-view indoor human behavior recognition based on 3D skeleton
    Peng, Ling
    Lu, Tongwei
    Min, Feng
    MIPPR 2015: PATTERN RECOGNITION AND COMPUTER VISION, 2015, 9813
  • [48] 3D Object Detection based on Multi-View Feature Point Matching
    Yang, Tian
    Sang, Xinzhu
    Chen, Duo
    Guo, Nan
    Wang, Peng
    Yu, Xunbo
    Yan, Binbin
    Wang, Kuiru
    Yu, Chongxiu
    AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [49] 3D Object Retrieval Based on Multi-View Latent Variable Model
    Liu, An-An
    Nie, Wei-Zhi
    Su, Yu-Ting
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (03) : 868 - 880
  • [50] NRTR: Neuron Reconstruction With Transformer From 3D Optical Microscopy Images
    Wang, Yijun
    Lang, Rui
    Li, Rui
    Zhang, Junsong
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (02) : 886 - 898