Unsupervised Hierarchical Iterative Tile Refinement Network With 3D Planar Segmentation Loss

被引:1
作者
Yang, Ruizhi [1 ,2 ,3 ]
Li, Xingqiang [1 ,2 ,4 ]
Cong, Rigang [1 ,2 ,4 ]
Du, Jinsong [1 ,2 ,4 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[2] Key Lab Intelligent Detect & Equipment Technol Lia, Shenyang 110169, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Feature extraction; Task analysis; Real-time systems; Network architecture; Image edge detection; Real-time stereo matching; unsupervised learning; unsupervised loss function; robotic vision;
D O I
10.1109/LRA.2024.3359545
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Unsupervised real-time stereo matching is of great research value in robot navigation due to its independence from ground truth and real-time efficiency. The core challenge lies in the design of loss functions that can provide accurate guidance and the efficient network architectures. The commonly used photometric loss is prone to provide incorrect guidance because of the influence of reflection, left-right color inconsistency, low texture, and occlusion. As a weak supplement, the smoothness loss can ameliorate the multi-solution problems caused by low texture, but it is not effective for strong incorrect guidance caused by the other problems. In order to provide more accurate and powerful supplementary guidance, a 3D planar segmentation loss is proposed with advancements in addressing the strong incorrect guidance problem, which could be generally integrated into traditional unsupervised training losses. Furthermore, the real-time stereo matching approach of the hierarchical iterative tile refinement network is applied to unsupervised stereo matching, with necessary modifications to address the detrimental architectures that hinder its performance in unsupervised training. Experimental results verify the effectiveness of the 3D planar segmentation loss and the network modification. The proposed pipeline achieves competitive accuracy compared to existing unsupervised stereo matching methods while maintaining real-time efficiency.
引用
收藏
页码:2678 / 2685
页数:8
相关论文
共 32 条
[2]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[3]   Learning Depth with Convolutional Spatial Propagation Network [J].
Cheng, Xinjing ;
Wang, Peng ;
Yang, Ruigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) :2361-2379
[4]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[5]   Digging Into Self-Supervised Monocular Depth Estimation [J].
Godard, Clement ;
Mac Aodha, Oisin ;
Firman, Michael ;
Brostow, Gabriel .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3827-3837
[6]   Occlusions, Motion and Depth Boundaries with a Generic Network for Disparity, Optical Flow or Scene Flow Estimation [J].
Ilg, Eddy ;
Saikia, Tonmoy ;
Keuper, Margret ;
Brox, Thomas .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :626-643
[7]   StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth Prediction [J].
Khamis, Sameh ;
Fanello, Sean ;
Rhemann, Christoph ;
Kowdle, Adarsh ;
Valentin, Julien ;
Izadi, Shahram .
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :596-613
[8]  
Kingma D. P., 2015, P 2015 INT C LEARNIN
[9]  
Lagisetty R, 2013, IEEE INTL CONF CONTR, P605, DOI 10.1109/CCA.2013.6662816
[10]   Occlusion Aware Stereo Matching via Cooperative Unsupervised Learning [J].
Li, Ang ;
Yuan, Zejian .
COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 :197-213