On the Road to Large-Scale 3D Monocular Scene Reconstruction using Deep Implicit Functions

被引:1
|
作者
Roddick, Thomas [1 ]
Biggs, Benjamin [1 ]
Reino, Daniel Olmeda [2 ]
Cipolla, Roberto [1 ]
机构
[1] Univ Cambridge, Cambridge, England
[2] Toyota Motor Europe, Brussels, Belgium
关键词
D O I
10.1109/ICCVW54120.2021.00322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous driving relies on building detailed models of a vehicles surroundings, including all hazards, obstacles and other road users. At present, much of the autonomous driving literature reduces the world to a collection of parametric 3D boxes. While this framework is sufficient for many driving scenarios, other important scene details (e.g. overhanging structures, open car doors, debris, potholes etc.) are not modelled. Recently deep implicit functions have been shown to be suitable for representing fine grained details at arbitrarily high resolutions using images alone. However, they have predominantly been employed in constrained situations, such as reconstructing individual objects or small-scale scenes. In this work we explore the application of deep implicit functions to larger scenes in the context of real-world autonomous driving scenarios. In particular we focus on the challenging case where only monocular images are available at test time. While most implicit function networks rely on watertight meshes for training, these are not in general available for real world scenes. We therefore propose an alternative training scheme using LiDAR to provide approximate ground truth occupancy supervision. We also show that incorporating priors such as pre-detected object bounding boxes can improve the quality of reconstruction. Our method is evaluated on a real-world autonomous driving dataset.
引用
收藏
页码:2875 / 2884
页数:10
相关论文
共 50 条
  • [21] Efficient convex optimization-based texture mapping for large-scale 3D scene reconstruction
    Sheng, Xin
    Yuan, Jing
    Tao, Wenbing
    Tao, Bo
    Liu, Liman
    INFORMATION SCIENCES, 2021, 556 : 143 - 159
  • [22] Long Range Pooling for 3D Large-Scale Scene Understanding
    Li, Xiang-Li
    Guo, Meng-Hao
    Mu, Tai-Jiang
    Martin, Ralph R.
    Hu, Shi-Min
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10300 - 10311
  • [23] Deep Implicit Moving Least-Squares Functions for 3D Reconstruction
    Liu, Shi-Lin
    Guo, Hao-Xiang
    Pan, Hao
    Wang, Peng-Shuai
    Tong, Xin
    Liu, Yang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1788 - 1797
  • [24] Power Bundle Adjustment for Large-Scale 3D Reconstruction
    Weber, Simon
    Demmel, Nikolaus
    Chan, Tin Chon
    Cremers, Daniel
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 281 - 289
  • [25] Large-scale outdoor 3D reconstruction on a mobile device
    Schoeps, Thomas
    Sattler, Torsten
    Hane, Christian
    Pollefeys, Marc
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 157 : 151 - 166
  • [26] A monocular thoracoscopic 3D scene reconstruction framework based on NeRF
    Han, Juntao
    Zhang, Ziming
    Tan, Wenjun
    Wang, Yufei
    Li, Mingxiao
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2025,
  • [27] Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey
    Jia, Zhihao
    Wang, Bing
    Chen, Changhao
    IMAGE AND VISION COMPUTING, 2024, 143
  • [28] Neural Surfel Reconstruction: Addressing Loop Closure Challenges in Large-Scale 3D Neural Scene Mapping
    Cui, Jiadi
    Zhang, Jiajie
    Kneip, Laurent
    Schwertfeger, Soren
    SENSORS, 2024, 24 (21)
  • [29] Fast and Seamless Large-scale Aerial 3D Reconstruction using Graph Framework
    Xie, Xiuchuan
    Yang, Tao
    Li, Jing
    Ren, Qiang
    Zhang, Yanning
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS PROCESSING (ICIGP 2018), 2018, : 126 - 130
  • [30] ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans
    Dai, Angela
    Ritchie, Daniel
    Bokeloh, Martin
    Reed, Scott
    Sturm, Juergen
    Niessner, Matthias
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4578 - 4587