IFA-Net: Isomerous Feature-aware Network for Single-view 3D Reconstruction

被引：0

作者：

Zhang, Zecheng ^{[1
]}

Han, Xianfeng ^{[1
]}

Xiao, Guoqian ^{[1
]}

机构：

[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

关键词：

3D Reconstruction; Single-view; Vision transformer; Convolutional neural networks; Feature-aware;

D O I：

10.1109/IJCNN54540.2023.10191001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Single-view 3D reconstruction has long been an intractable and fundamental problem in computer vision. Objects with complex topological structures are difficult to be accurately reconstructed, which makes the existing methods suffer from blurred shape boundaries between multiple components in the object. Recently, convolutional neural network and vision transformer have begun to appear in the field of 3D reconstruction and have been widely used with excellent performance. However, the existing transformer-based methods mainly focus on the global long-term context dependency, and ignore the local details of the part space features, resulting in poor reconstruction of the detail part. In this paper, we propose a novel dual-branch network architecture, called IFA-Net, to capture local spatial perception information and retain global structural features for singleview 3D reconstruction. In addition, we propose an isomerous feature-aware module, which enables the dynamic fusion of different resolution features under the two branches. Thus, high-fidelity and detail-rich 3D object reconstruction can be achieved. Extensive experimental results demonstrate that our method is able to produce high-quality voxels, particularly with diverse topologies, as compared with the state-of-the-art methods.

引用

页数：8

共 27 条

[1] Bozic A, 2021, ADV NEUR IN
[2] Learning Implicit Fields for Generative Shape Modeling
Chen, Zhiqin
Zhang, Hao
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5932 - 5941
[3] 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
Choy, Christopher B.
Xu, Danfei
Gwak, Jun Young
Chen, Kevin
Savarese, Silvio
[J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 628 - 644
[4] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[5] A Papier-Mache Approach to Learning 3D Surface Generation
Groueix, Thibault
Fisher, Matthew
Kim, Vladimir G.
Russell, Bryan C.
Aubry, Mathieu
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 216 - 224
[6] Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era
Han, Xian-Feng
Laga, Hamid
Bennamoun, Mohammed
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1578 - 1604
[7] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[8] Hendrycks D, 2020, Arxiv, DOI arXiv:1606.08415
[9] Kar A, 2017, ADV NEUR IN, V30
[10] Kingma D. P., 2014, arXiv

← 1 2 3 →