Guard-Net: Lightweight Stereo Matching Network via Global and Uncertainty-Aware Refinement for Autonomous Driving

被引:3
作者
Liu, Yujun [1 ]
Zhang, Xiangchen [1 ]
Luo, Yang [1 ]
Hao, Qiaoqiao [1 ]
Su, Jinhe [1 ]
Cai, Guorong [1 ]
机构
[1] Jimei Univ, Sch Comp Engn, Xiamen 361021, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Autonomous vehicles; Costs; Uncertainty; Solid modeling; Transformers; Optimization; Stereo matching; global feature; disparity refinement; intelligent transportation; autonomous driving;
D O I
10.1109/TITS.2024.3357841
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Stereo matching is a prominent research area in autonomous driving and computer vision. Despite significant progress made by learning-based methods, accurately predicting disparities in hazardous regions, which is crucial for ensuring safe vehicle operation, remains challenging. The limitations of methods based on Convolutional Neural Networks (CNNs) are most noticeable in textureless regions and repetitive patterns, leading to unreliable predictions. Furthermore, calculating disparities for boundaries and thin structures, where the disparity jump phenomenon is prominent remains difficult. To address these issues, we propose a lightweight stereo matching architecture that focuses on obtaining real-time and high-precision disparity maps in hazardous areas. We exploit an efficient global enhanced path to provide global representations in ill-posed regions, where CNN-based approaches often struggle. Second, our model integrates local and global features to generate more reliable cost volume. Finally, our innovative uncertainty-aware module refines disparity, making full use of high-frequency detailed information and uncertainty attention, effectively preserving complex structures. Comprehensive experimental studies on SceneFlow demonstrate our method outperforms state-of-the-art methods, achieving an End-Point Error (EPE) of 0.47 with only 3.60M parameters. The effectiveness of our method speed-accuracy trade-off is further confirmed by competitive results obtained from the KITTI 2012 and KITTI 2015 experiments. Code is available at: https://github.com/YJLCV/Guard-Net.
引用
收藏
页码:10260 / 10273
页数:14
相关论文
共 65 条
  • [1] Adam P., 2017, P 31 C NEUR INF PROC, P1, DOI DOI 10.1145/3434309
  • [2] Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation
    Bangunharcana, Antyanta
    Cho, Jae Won
    Lee, Seokju
    Kweon, In So
    Kim, Kyung-Soo
    Kim, Soohyun
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3542 - 3548
  • [3] Perception in Disparity: An Efficient Navigation Framework for Autonomous Vehicles With Stereo Cameras
    Cao, Teng
    Xiang, Zhi-Yu
    Liu, Ji-Lin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (05) : 2935 - 2948
  • [4] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [5] Pyramid Stereo Matching Network
    Chang, Jia-Ren
    Chen, Yong-Sheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418
  • [6] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [7] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
  • [8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [9] Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation
    Chen, Liyan
    Wang, Weihan
    Mordohai, Philippos
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17235 - 17244
  • [10] Multi-Dimensional Cooperative Network for Stereo Matching
    Chen, Wei
    Jia, Xiaogang
    Wu, Mingfei
    Liang, Zhengfa
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 581 - 587