Self-supervised real-time depth restoration for consumer-grade sensors

被引:0
作者
Duarte, Alexandre [1 ]
Fernandes, Francisco [2 ]
Pereira, Joao M. [1 ,2 ]
Moreira, Catarina [2 ,4 ]
Nascimento, Jacinto C. [1 ,3 ]
Jorge, Joaquim [1 ,2 ]
机构
[1] Univ Lisboa IST UL, Inst Super Tecn, P-1000029 Lisbon, Portugal
[2] Inst Engn Sistemas & Comp Invest & Desenvolvimento, P-1000029 Lisbon, Portugal
[3] Univ Tecn Lisboa IST UL, ISR, Inst Super Tecn, P-1049001 Lisbon, Portugal
[4] Univ Technol Sydney, Human Technol Inst, Sydney, Australia
关键词
Deep learning; Self-supervised learning; Image denoising; Image reconstruction; RGB-D sensors; CALIBRATION;
D O I
10.1007/s11554-024-01491-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depth maps produced by consumer-grade sensors suffer from inaccurate measurements and missing data from either system or scene-specific sources. Data-driven denoising algorithms can mitigate such problems; however, they require vast amounts of ground truth depth data. Recent research has tackled this limitation using self-supervised learning techniques, but it requires multiple RGB-D sensors. Moreover, most existing approaches focus on denoising single isolated depth maps or specific subjects of interest highlighting a need for methods that can effectively denoise depth maps in real-time dynamic environments. This paper extends state-of-the-art approaches for depth-denoising commodity depth devices, proposing SelfReDepth, a self-supervised deep learning technique for depth restoration, via denoising and hole-filling by inpainting of full-depth maps captured with RGB-D sensors. The algorithm targets depth data in video streams, utilizing multiple sequential depth frames coupled with color data to achieve high-quality depth videos with temporal coherence. Finally, SelfReDepth is designed to be compatible with various RGB-D sensors and usable in real-time scenarios as a pre-processing step before applying other depth-dependent algorithms. Our results demonstrate our approach's real-time performance on real-world datasets shows that it outperforms state-of-the-art methods in denoising and restoration performance at over 30 fps on Commercial Depth Cameras, with potential benefits for augmented and mixed-reality applications.
引用
收藏
页数:14
相关论文
共 64 条
  • [1] Metzler CA, 2020, Arxiv, DOI arXiv:1805.10531
  • [2] Robust Intrinsic and Extrinsic Calibration of RGB-D Cameras
    Basso, Filippo
    Menegatti, Emanuele
    Pretto, Alberto
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (05) : 1315 - 1332
  • [3] Batson J, 2019, PR MACH LEARN RES, V97
  • [4] Bertalmio M., 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, pI, DOI 10.1109/CVPR.2001.990497
  • [5] Buyue Zhang, 2007, Proceedings 2007 IEEE International Conference on Image Processing, ICIP 2007, P417
  • [6] Improved Noise2Noise Denoising with Limited Data
    Calvarons, Adria Font
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 796 - 805
  • [7] Capecci M, 2016, 2016 3RD IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, P256, DOI 10.1109/BHI.2016.7455883
  • [8] Cha SM, 2021, Arxiv, DOI arXiv:1905.10488
  • [9] Chambolle A, 2004, J MATH IMAGING VIS, V20, P89
  • [10] Matterport3D: Learning from RGB-D Data in Indoor Environments
    Chang, Angel
    Dai, Angela
    Funkhouser, Thomas
    Halber, Maciej
    Niessner, Matthias
    Savva, Manolis
    Song, Shuran
    Zeng, Andy
    Zhang, Yinda
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 667 - 676