CIGNet: Category-and-Intrinsic-Geometry Guided Network for 3D coarse-to-fine reconstruction

被引:4
|
作者
Gao, Junna [1 ]
Kong, Dehui [1 ]
Wang, Shaofan [1 ]
Li, Jinghua [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
3D reconstruction; 3D refinement; Graph convolutional network; Category prior; Geometry perception; OBJECT RECONSTRUCTION;
D O I
10.1016/j.neucom.2023.126607
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object reconstruction from arbitrary view intensity images is a challenging but meaningful research topic in computer vision. The main limitations of existing approaches are that they lack complete and efficient prior information and might not be able to deal with serious occlusion or partial observation of 3D objects, which may produce incomplete and unreliable reconstructions. To reconstruct structure and recover missing or unseen parts of objects, category prior and intrinsic geometry relation are particularly useful and necessary during the 3D reconstruction process. In this paper, we propose Category-and-Intrinsic-Geometry Guided Network (CIGNet) for 3D coarse-to-fine reconstruction from arbitrary view intensity images by leveraging category prior and intrinsic geometry relation. CIGNet combines a category prior guided reconstruction module with an intrinsic geometry relation guided refinement module. In the first reconstruction module, we leverage semantic class context by adding a supervision term over object categories to output coarse reconstructed results. In the second refinement module, we model the coarse 3D volumetric data as 2D slices and consider intrinsic geometry relations between them to design graph structures of coarse 3D volumes to finish the graph based refinement. CIGNet can accomplish high-quality 3D reconstruction tasks by exploring the intra-category characteristics of objects as well as the intrinsic geometry relations of each object, both of which serve as useful complements to the visual information of images, in a coarse-to-fine fashion. Extensive quantitative and qualitative experiments on a synthetic dataset ShapeNet and real-world datasets Pix3D, Statue Model Repository, and BlendedMVS indicate that CIGNet outperforms several state-of-the-art methods in terms of accuracy and detail recovery.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Photon-Efficient 3D Reconstruction with A Coarse-to-Fine Neural Network
    Guo, Shangwei
    Lai, Zhengchao
    Li, Jun
    Han, Shaokun
    OPTICS AND LASERS IN ENGINEERING, 2022, 159
  • [2] A Coarse-to-Fine Model for 3D Pose Estimation and Sub-category Recognition
    Mottaghi, Roozbeh
    Xiang, Yu
    Savarese, Silvio
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 418 - 426
  • [3] Coarse-to-fine pipeline for 3D wireframe reconstruction from point cloud
    Tan, Xuefeng
    Zhang, Dejun
    Tian, Long
    Wu, Yiqi
    Chen, Yilin
    COMPUTERS & GRAPHICS-UK, 2022, 106 : 288 - 298
  • [4] Coarse-to-fine cascaded 3D hand reconstruction based on SSGC and MHSA
    Yang, Wenji
    Xie, Liping
    Qian, Wenbin
    Wu, Canghai
    Yang, Hongyun
    VISUAL COMPUTER, 2025, 41 (01): : 11 - 24
  • [5] Coarse-to-Fine 3D Human Pose Estimation
    Guo, Yu
    Zhao, Lin
    Zhang, Shanshan
    Yang, Jian
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 579 - 592
  • [6] Coarse-to-fine multiview 3d face reconstruction using multiple geometrical features
    Dai, Peng
    Wang, Xue
    Zhang, Weihang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (01) : 939 - 966
  • [7] 3D mouse brain reconstruction from histology using a coarse-to-fine approach
    Yushkevich, Paul A.
    Avants, Brian B.
    Ng, Lydia
    Hawrylycz, Michael
    Burstein, Pablo D.
    Zhang, Hui
    Gee, James C.
    BIOMEDICAL IMAGE REGISTRATION, PROCEEDINGS, 2006, 4057 : 230 - 237
  • [8] Coarse-to-fine multiview 3d face reconstruction using multiple geometrical features
    Peng Dai
    Xue Wang
    Weihang Zhang
    Multimedia Tools and Applications, 2018, 77 : 939 - 966
  • [9] Coarse-to-fine stereo vision with accurate 3D boundaries
    Sizintsev, Mikhail
    Wildes, Richard P.
    IMAGE AND VISION COMPUTING, 2010, 28 (03) : 352 - 366
  • [10] A coarse-to-fine keypoint detection method for 3D model
    1600, International Frequency Sensor Association, 46 Thorny Vineway, Toronto, ON M2J 4J2, Canada (160):