Guard-Net: Lightweight Stereo Matching Network via Global and Uncertainty-Aware Refinement for Autonomous Driving

被引：3

作者：

Liu, Yujun ^{[1
]}

Zhang, Xiangchen ^{[1
]}

Luo, Yang ^{[1
]}

Hao, Qiaoqiao ^{[1
]}

Su, Jinhe ^{[1
]}

Cai, Guorong ^{[1
]}

机构：

[1] Jimei Univ, Sch Comp Engn, Xiamen 361021, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Correlation; Autonomous vehicles; Costs; Uncertainty; Solid modeling; Transformers; Optimization; Stereo matching; global feature; disparity refinement; intelligent transportation; autonomous driving;

D O I：

10.1109/TITS.2024.3357841

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Stereo matching is a prominent research area in autonomous driving and computer vision. Despite significant progress made by learning-based methods, accurately predicting disparities in hazardous regions, which is crucial for ensuring safe vehicle operation, remains challenging. The limitations of methods based on Convolutional Neural Networks (CNNs) are most noticeable in textureless regions and repetitive patterns, leading to unreliable predictions. Furthermore, calculating disparities for boundaries and thin structures, where the disparity jump phenomenon is prominent remains difficult. To address these issues, we propose a lightweight stereo matching architecture that focuses on obtaining real-time and high-precision disparity maps in hazardous areas. We exploit an efficient global enhanced path to provide global representations in ill-posed regions, where CNN-based approaches often struggle. Second, our model integrates local and global features to generate more reliable cost volume. Finally, our innovative uncertainty-aware module refines disparity, making full use of high-frequency detailed information and uncertainty attention, effectively preserving complex structures. Comprehensive experimental studies on SceneFlow demonstrate our method outperforms state-of-the-art methods, achieving an End-Point Error (EPE) of 0.47 with only 3.60M parameters. The effectiveness of our method speed-accuracy trade-off is further confirmed by competitive results obtained from the KITTI 2012 and KITTI 2015 experiments. Code is available at: https://github.com/YJLCV/Guard-Net.

引用

页码：10260 / 10273

页数：14

共 65 条

[1] Adam P., 2017, P 31 C NEUR INF PROC, P1, DOI DOI 10.1145/3434309
[2] Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation
Bangunharcana, Antyanta
Cho, Jae Won
Lee, Seokju
Kweon, In So
Kim, Kyung-Soo
Kim, Soohyun
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3542 - 3548
[3] Perception in Disparity: An Efficient Navigation Framework for Autonomous Vehicles With Stereo Cameras
Cao, Teng
Xiang, Zhi-Yu
Liu, Ji-Lin
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (05) : 2935 - 2948
[4] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[5] Pyramid Stereo Matching Network
Chang, Jia-Ren
Chen, Yong-Sheng
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418
[6] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Chen, Jierun
Kao, Shiu-Hong
He, Hao
Zhuo, Weipeng
Wen, Song
Lee, Chul-Ho
Chan, S. -H. Gary
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
[7] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[9] Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation
Chen, Liyan
Wang, Weihan
Mordohai, Philippos
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17235 - 17244
[10] Multi-Dimensional Cooperative Network for Stereo Matching
Chen, Wei
Jia, Xiaogang
Wu, Mingfei
Liang, Zhengfa
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 581 - 587

← 1 2 3 4 5 6 7 →