SS3DNet-AF: A Single-Stage, Single-View 3D Reconstruction Network with Attention-Based Fusion

被引:0
作者
Shoukat, Muhammad Awais [1 ]
Sargano, Allah Bux [1 ]
Malyshev, Alexander [2 ]
You, Lihua [3 ]
Habib, Zulfiqar [1 ]
机构
[1] Comsats Univ Islamabad, Dept Comp Sci, Lahore Campus, Lahore 54000, Pakistan
[2] Univ Bergen, Dept Math, N-5020 Bergen, Norway
[3] Bournemouth Univ, Natl Ctr Comp Animat, Poole BH12 5BB, England
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 23期
关键词
SS3DNet-AF; 3D reconstruction; attention-based fusion; point clouds; IMAGE;
D O I
10.3390/app142311424
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Learning object shapes from a single image is challenging due to variations in scene content, geometric structures, and environmental factors, which create significant disparities between 2D image features and their corresponding 3D representations, hindering the effective training of deep learning models. Existing learning-based approaches can be divided into two-stage and single-stage methods, each with limitations. Two-stage methods often rely on generating intermediate proposals by searching for similar structures across the entire dataset, a process that is computationally expensive due to the large search space and high-dimensional feature-matching requirements, further limiting flexibility to predefined object categories. In contrast, single-stage methods directly reconstruct 3D shapes from images without intermediate steps, but they struggle to capture complex object geometries due to high feature loss between image features and 3D shapes and limit their ability to represent intricate details. To address these challenges, this paper introduces SS3DNet-AF, a single-stage, single-view 3D reconstruction network with an attention-based fusion (AF) mechanism to enhance focus on relevant image features, effectively capturing geometric details and generalizing across diverse object categories. The proposed method is quantitatively evaluated using the ShapeNet dataset, demonstrating its effectiveness in achieving accurate 3D reconstructions while overcoming the computational challenges associated with traditional approaches.
引用
收藏
页数:16
相关论文
共 41 条
  • [1] Pixel2point: 3D Object Reconstruction From a Single Image Using CNN and Initial Sphere
    Afifi, Ahmed J.
    Magnusson, Jannes
    Soomro, Toufique A.
    Hellwich, Olaf
    [J]. IEEE ACCESS, 2021, 9 : 110 - 121
  • [2] Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age
    Cadena, Cesar
    Carlone, Luca
    Carrillo, Henry
    Latif, Yasir
    Scaramuzza, Davide
    Neira, Jose
    Reid, Ian
    Leonard, John J.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) : 1309 - 1332
  • [3] 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
    Choy, Christopher B.
    Xu, Danfei
    Gwak, Jun Young
    Chen, Kevin
    Savarese, Silvio
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 628 - 644
  • [4] Eigen D., 2014, arXiv
  • [5] A Point Set Generation Network for 3D Object Reconstruction from a Single Image
    Fan, Haoqiang
    Su, Hao
    Guibas, Leonidas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2463 - 2471
  • [6] Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network
    Feng, Yao
    Wu, Fan
    Shao, Xiaohu
    Wang, Yanfeng
    Zhou, Xi
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 557 - 574
  • [7] Learning a Predictable and Generative Vector Representation for Objects
    Girdhar, Rohit
    Fouhey, David F.
    Rodriguez, Mikel
    Gupta, Abhinav
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 484 - 499
  • [8] Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era
    Han, Xian-Feng
    Laga, Hamid
    Bennamoun, Mohammed
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1578 - 1604
  • [9] Hartley R., 2003, MULTIPLE VIEW GEOMET, DOI 10.1016/S0143-8166(01)00145-2
  • [10] Single-View Reconstruction via Joint Analysis of Image and Shape Collections
    Huang, Qixing
    Wang, Hai
    Koltun, Vladlen
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):