On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks

被引:8
|
作者
Jung, HyunJun [1 ]
Ruhkamp, Patrick [1 ,2 ]
Zhai, Guangyao [1 ]
Brasch, Nikolas [1 ]
Li, Yitong [1 ]
Verdie, Yannick [1 ,3 ]
Song, Jifei [3 ]
Zhou, Yiren [3 ]
Armagan, Anil [3 ]
Ilic, Slobodan [1 ,4 ]
Leonardis, Ales [3 ]
Navab, Nassir [1 ]
Busam, Benjamin [1 ,2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Dwe Ai, Munich, Germany
[3] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[4] Siemens AG, Munich, Germany
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
关键词
D O I
10.1109/CVPR52729.2023.00082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning-based methods to solve dense 3D vision problems typically train on 3D sensor data. The respectively used principle of measuring distances provides advantages and drawbacks. These are typically not compared nor discussed in the literature due to a lack of multi-modal datasets. Texture-less regions are problematic for structure from motion and stereo, reflective material poses issues for active sensing, and distances for translucent objects are intricate to measure with existing hardware. Training on inaccurate or corrupt data induces model bias and hampers generalisation capabilities. These effects remain unnoticed if the sensor measurement is considered as ground truth during the evaluation. This paper investigates the effect of sensor errors for the dense 3D vision tasks of depth estimation and reconstruction. We rigorously show the significant impact of sensor characteristics on the learned predictions and notice generalisation issues arising from various technologies in everyday household environments. For evaluation, we introduce a carefully designed dataset1 comprising measurements from commodity sensors, namely D-ToF, I-ToF, passive/active stereo, and monocular RGB+P. Our study quantifies the considerable sensor noise impact and paves the way to improved dense vision estimates and targeted data fusion.
引用
收藏
页码:780 / 791
页数:12
相关论文
共 50 条
  • [1] A Feasibility Study of Accurate 3D Measurement of Ships Using Dense Stereo Vision System
    Nomura, Yasuhiro
    Yamamoto, Shigehiro
    Yoshihara, Kotaro
    Hashimoto, Takeshi
    TECHNO-OCEAN 2016: RETURN TO THE OCEANS, 2016, : 562 - 565
  • [2] 3D Depthwise Convolution: Reducing Model Parameters in 3D Vision Tasks
    Ye, Rongtian
    Liu, Fangyu
    Zhang, Liqiang
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 186 - 199
  • [3] Efficient and Accurate Spatial Queries Using Lossy Compressed 3D Geometry Data
    Teng, Dejun
    Li, Zhaochuan
    Peng, Zhaohui
    Ma, Shuai
    Wang, Fusheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (05) : 2472 - 2487
  • [4] Dense Robust 3D Reconstruction and Measurement for 3D Printing Process Based on Vision
    Lv, Ning
    Wang, Chengyu
    Qiao, Yujing
    Zhang, Yongde
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [5] Acquisition of a Dense 3D Model Database for Robotic Vision
    Zia, Muhammad Zeeshan
    Klank, Ulrich
    Beetz, Michael
    ICAR: 2009 14TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, VOLS 1 AND 2, 2009, : 303 - 308
  • [6] Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
    Wu, Cho-Ying
    Xu, Qiangeng
    Neumann, Ulrich
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 453 - 463
  • [7] Hierarchical, Dense and Dynamic 3D Reconstruction Based on VDB Data Structure for Robotic Manipulation Tasks
    Mateo, Carlos M.
    Corrales, Juan A.
    Mezouar, Youcef
    FRONTIERS IN ROBOTICS AND AI, 2021, 7
  • [8] Using stereo geometry towards accurate 3D reconstruction
    Wang, Z.
    Boufama, B.
    2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 132 - 138
  • [9] A dental vision system for accurate 3D tooth modeling
    Zhang, Li
    Alemzadeh, Kazem
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 422 - 425
  • [10] Towards DRM for 3D geometry data
    Gschwandtner, Michael
    Uhl, Andreas
    SECURITY, FORENSICS, STEGANOGRAPHY, AND WATERMARKING OF MULTIMEDIA CONTENTS X, 2008, 6819