On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks

被引:8
|
作者
Jung, HyunJun [1 ]
Ruhkamp, Patrick [1 ,2 ]
Zhai, Guangyao [1 ]
Brasch, Nikolas [1 ]
Li, Yitong [1 ]
Verdie, Yannick [1 ,3 ]
Song, Jifei [3 ]
Zhou, Yiren [3 ]
Armagan, Anil [3 ]
Ilic, Slobodan [1 ,4 ]
Leonardis, Ales [3 ]
Navab, Nassir [1 ]
Busam, Benjamin [1 ,2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Dwe Ai, Munich, Germany
[3] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[4] Siemens AG, Munich, Germany
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
关键词
D O I
10.1109/CVPR52729.2023.00082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning-based methods to solve dense 3D vision problems typically train on 3D sensor data. The respectively used principle of measuring distances provides advantages and drawbacks. These are typically not compared nor discussed in the literature due to a lack of multi-modal datasets. Texture-less regions are problematic for structure from motion and stereo, reflective material poses issues for active sensing, and distances for translucent objects are intricate to measure with existing hardware. Training on inaccurate or corrupt data induces model bias and hampers generalisation capabilities. These effects remain unnoticed if the sensor measurement is considered as ground truth during the evaluation. This paper investigates the effect of sensor errors for the dense 3D vision tasks of depth estimation and reconstruction. We rigorously show the significant impact of sensor characteristics on the learned predictions and notice generalisation issues arising from various technologies in everyday household environments. For evaluation, we introduce a carefully designed dataset1 comprising measurements from commodity sensors, namely D-ToF, I-ToF, passive/active stereo, and monocular RGB+P. Our study quantifies the considerable sensor noise impact and paves the way to improved dense vision estimates and targeted data fusion.
引用
收藏
页码:780 / 791
页数:12
相关论文
共 50 条
  • [21] Accurate recovery of dense depth map for 3D motion based coding
    Georgis, N
    Kittler, J
    Bober, M
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 2000, 11 (02): : 219 - 232
  • [23] A Quality Metric for 3D LiDAR Point Cloud based on Vision Tasks
    Zhao, Heng
    Yang, Qi
    Zhu, Wenjie
    Xu, Yiling
    2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
  • [24] A dynamic programming algorithm applied to omnidirectional vision for dense 3D reconstruction
    Boutteau, R.
    Savatier, X.
    Ertaud, J. Y.
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 927 - 931
  • [25] An efficient dense descriptor applied to 3D vision implemented on parallel computing
    Rosas-Miranda, Dario I.
    Ponomaryov, Volodymyr I.
    Robles-Gonzalez, Cesar M. A.
    REAL-TIME IMAGE AND VIDEO PROCESSING 2018, 2018, 10670
  • [26] Dense 3D Semantic SLAM of traffic environment based on stereo vision
    Li, Linhui
    Liu, Zhijie
    Ozguner, Umit
    Lian, Jing
    Zhou, Yafu
    Zhao, Yibing
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 965 - 970
  • [27] Evaluation of an appearance-based 3D face tracker using dense 3D data
    Fadi Dornaika
    Angel D. Sappa
    Machine Vision and Applications, 2008, 19 : 427 - 441
  • [28] Evaluation of an appearance-based 3D face tracker using dense 3D data
    Dornaika, Fadi
    Sappa, Angel D.
    MACHINE VISION AND APPLICATIONS, 2008, 19 (5-6) : 427 - 441
  • [29] Accurate 3D shape measurement of multiple separate objects with stereo vision
    Kieu, Hien
    Pan, Tongyan
    Wang, Zhaoyang
    Le, Minh
    Nguyen, Hieu
    Vo, Minh
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2014, 25 (03)
  • [30] Accurate Indoor 3D Location Based on MEMS/Vision by Using A Smartphone
    Shu, Mingcong
    Chen, Guoliang
    Zhang, Zhenghua
    Xu, Lei
    2022 IEEE 12TH INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN 2022), 2022,