Enhanced 3D Shape Reconstruction With Knowledge Graph of Category Concept

被引:1
|
作者
Sun, Guofei [1 ]
Wong, Yongkang [2 ]
Kankanhalli, Mohan S. [3 ]
Li, Xiangdong [4 ]
Geng, Weidong [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD & CG, Coll Comp Sci & Technol, Zheda Rd 38, Hangzhou 310027, Zhejiang, Peoples R China
[2] Natl Univ Singapore, Sch Comp, 3 Res Link,I4-0 Bldg, Singapore 117602, Singapore
[3] Natl Univ Singapore, Sch Comp, 11 Comp Dr,AS6 Bldg, Singapore 117416, Singapore
[4] Zhejiang Univ, Coll Comp Sci & Technol, Zheda Rd 38, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; 3D reconstruction; conceptual knowledge; SIMULTANEOUS LOCALIZATION;
D O I
10.1145/3491224
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reconstructing three-dimensional (3D) objects from images has attracted increasing attention due to its wide applications in computer vision and robotic tasks. Despite the promising progress of recent deep learning-based approaches, which directly reconstruct the full 3D shape without considering the conceptual knowledge of the object categories, existing models have limited usage and usually create unrealistic shapes. 3D objects have multiple forms of representation, such as 3D volume, conceptual knowledge, and so on. In this work, we show that the conceptual knowledge for a category of objects, which represents objects as prototype volumes and is structured by graph, can enhance the 3D reconstruction pipeline. We propose a novel multimodal framework that explicitly combines graph-based conceptual knowledge with deep neural networks for 3D shape reconstruction from a single RGB image. Our approach represents conceptual knowledge of a specific category as a structure-based knowledge graph. Specifically, conceptual knowledge acts as visual priors and spatial relationships to assist the 3D reconstruction framework to create realistic 3D shapes with enhanced details. Our 3D reconstruction framework takes an image as input. It first predicts the conceptual knowledge of the object in the image, then generates a 3D object based on the input image and the predicted conceptual knowledge. The generated 3D object satisfies the following requirements: (1) it is consistent with the predicted graph in concept, and (2) consistent with the input image in geometry. Extensive experiments on public datasets (i.e., ShapeNet, Pix3D, and Pascal3D+) with 13 object categories show that (1) our method outperforms the state-of-the-art methods, (2) our prototype volume-based conceptual knowledge representation is more effective, and (3) our pipeline-agnostic approach can enhance the reconstruction quality of various 3D shape reconstruction pipelines.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] 3D shape knowledge graph for cross-domain 3D shape retrieval
    Chang, Rihao
    Ma, Yongtao
    Hao, Tong
    Wang, Weijie
    Nie, Weizhi
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (05) : 1199 - 1216
  • [2] 3D Model Retrieval Based on a 3D Shape Knowledge Graph
    Nie, Weizhi
    Wang, Ya
    Song, Dan
    Li, Wenhui
    IEEE Access, 2020, 8 : 142632 - 142641
  • [3] 3D Model Retrieval Based on a 3D Shape Knowledge Graph
    Nie, Weizhi
    Wang, Ya
    Song, Dan
    Li, Wenhui
    IEEE ACCESS, 2020, 8 : 142632 - 142641
  • [4] 3D shape reconstruction from focus and enhanced focus volume
    Wang, Yuezong
    Zhang, Lu
    Chen, Jiqiang
    Zhang, Jialun
    MEASUREMENT, 2025, 245
  • [5] 3D shape modeling and reconstruction
    Wang, HY
    Ghosh, BK
    WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 4, PROCEEDINGS, 1998, : 677 - 681
  • [6] Aspects of 3D Shape Reconstruction
    Stiller, Peter F.
    Arnold, Gregory
    Ferrara, Matthew
    COMPUTATIONAL IMAGING VII, 2009, 7246
  • [7] Category-Sensitive Incremental Learning for Image-Based 3D Shape Reconstruction
    Zhong, Yijie
    Sun, Zhengxing
    Luo, Shoutong
    Sun, Yunhan
    Zhang, Wei
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 231 - 244
  • [8] The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction
    Anh Thai
    Stojanov, Stefan
    Huang, Zixuan
    Rehg, James M.
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 209 - 218
  • [9] Reducing Intricacy of 3D Space for 3D Shape Reconstruction
    Muhammad, Mannan Saeed
    Choi, Tae-Sun
    INFRARED SYSTEMS AND PHOTOELECTRONIC TECHNOLOGY III, 2008, 7055
  • [10] Single-Image 3D Human Pose and Shape Estimation Enhanced by Clothed 3D Human Reconstruction
    Liu, Leyuan
    Gao, Yunqi
    Sun, Jianchi
    Chen, Jingying
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 33 - 44