Single-view 3D reconstruction method based on neural radiation field occlusion optimization

被引:0
|
作者
Chen, Zhijie [1 ]
Deng, Huiping [1 ]
Xiang, Sen [1 ]
Wu, Jin [1 ]
机构
[1] Wuhan Univ Sci & Technol, Sch Informat Sci & Engn, Wuhan 430081, Peoples R China
基金
中国国家自然科学基金;
关键词
neural radiation field; shading; null convolution; transformer;
D O I
10.37188/CJLCD.2023-0362
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
Single-view 3D reconstruction aims to restore the three-dimensional-dimensional geometry of an object or scene based on a single 2D image. The limited information provided by a single view often results in occlusion, , leading to blurred image features and inhibiting the accurate recovery of object appearance details. This paper introduces a framework, , called neural radiation field (NeRF ), that effectively addresses the occlusion problem by utilizing both global and local context information of the image. The proposed approach employs the Vision Transformer to capture long-range-range correlations and learn global features from the image. The Vision Transformer is combined with the SE channel attention mechanism module to prevent information loss and redundancy across multiple layers. Additionally, , a convolutional neural network is utilized to extract pixel-aligned local image features. The dilated convolution of the dilated pyramid pooling structure is employed to increase the receptive field, , extract multi-scale context information, , and provide more details for restoring occluded areas. Lastly, , a density feature aggregation module based on the Transformer architecture is designed to minimize inaccuracies in density prediction due to occlusion. Experimental results on the ShapeNet-NMR-NMR dataset demonstrate the method's 's ability to produce new views with enhanced details and exhibit strong generalization capabilities when applied to unseen objects.
引用
收藏
页码:1402 / 1410
页数:9
相关论文
共 18 条
  • [1] FWD: Real-time Novel View Synthesis with Forward Warping and Depth
    Cao, Ang
    Rockwell, Chris
    Johnson, Justin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15692 - 15703
  • [2] MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
    Chen, Anpei
    Xu, Zexiang
    Zhao, Fuqiang
    Zhang, Xiaoshuai
    Xiang, Fanbo
    Yu, Jingyi
    Su, Hao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14104 - 14113
  • [3] Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes
    Chibane, Julian
    Bansal, Aayush
    Lazova, Verica
    Pons-Moll, Gerard
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7907 - 7916
  • [4] Dosovitskiy A., 2021, P ICLR
  • [5] Fast and Explicit Neural View Synthesis
    Guo, Pengsheng
    Bautista, Miguel Angel
    Colburn, Alex
    Yang, Liang
    Ulbricht, Daniel
    Susskind, Joshua M.
    Shan, Qi
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 11 - 20
  • [6] Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
    Lin, Kai-En
    Lin Yen-Chen
    Lai, Wei-Sheng
    Lin, Tsung-Yi
    Shih, Yi-Chang
    Ramamoorthi, Ravi
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 806 - 815
  • [7] Neural Rays for Occlusion-aware Image-based Rendering
    Liu, Yuan
    Peng, Sida
    Liu, Lingjie
    Wang, Qianqian
    Wang, Peng
    Theobalt, Christian
    Zhou, Xiaowei
    Wang, Wenping
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7814 - 7823
  • [8] Mildenhall B, 2022, COMMUN ACM, V65, P99, DOI 10.1145/3503250
  • [9] Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision
    Niemeyer, Michael
    Mescheder, Lars
    Oechsle, Michael
    Geiger, Andreas
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3501 - 3512
  • [10] Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
    Sajjadi, Mehdi S. M.
    Meyer, Henning
    Pot, Etienne
    Bergmann, Urs
    Greff, Klaus
    Radwan, Noha
    Vora, Suhani
    Lucic, Mario
    Duckworth, Daniel
    Dosovitskiy, Alexey
    Uszkoreit, Jakob
    Funkhouser, Thomas
    Tagliasacchi, Andrea
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6219 - 6228