Real-Time Stereo Matching Network Based on 3D Channel and Disparity Attention for Edge Devices Toward Autonomous Driving

被引:2
作者
Liang, Bifa [1 ]
Yang, Hong [2 ]
Huang, Jinhao [1 ]
Liu, Cheng [1 ]
Yang, Ru [2 ]
机构
[1] Guangzhou Univ, Sch Elect & Commun Engn, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Sch Mech & Elect Engn, Guangzhou 510006, Peoples R China
关键词
Real-time stereo matching; autonomous driving; edge device; group-wise L1 distance & group-wise correlation; attention-based 3D cost aggregation;
D O I
10.1109/ACCESS.2023.3297052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stereo matching is an important component technology that constitutes the 3D perception capability of autonomous vehicles. On resource-constrained edge devices, it is very important to compute in real-time with very low time. However, most stereo matching networks focus on generating disparity maps on high-end GPUs, which do not meet the real-time requirements on edge devices. To solve this problem, a new stereo matching network is proposed in this paper to achieve real-time stereo matching on edge devices. The proposed network greatly improves the inference speed by constructing a low-resolution feature extractor, and by using multi-stage residual methods for stereo matching. In particular, we propose a method that combines the group-wise L1 distance & the group-wise correlation cost volume and an effective attention-based 3D cost aggregation method. Our network achieves a good balance between speed and accuracy on the KITTI 2012 and KITTI 2015 datasets. The proposed network achieves 2.77% and 3.44% accuracy (D1-all) on KITTI 2012 and KITTI 2015, respectively. With TensorRT, the proposed network achieves 31.8 FPS and outperforms the real-time results of most state-of-the-art networks on NVIDIA Jetson Nano edge devices.
引用
收藏
页码:76781 / 76792
页数:12
相关论文
共 35 条
  • [31] DrivingStereo: A Large-Scale Dataset for Stereo Matching in Autonomous Driving Scenarios
    Yang, Guorun
    Song, Xiao
    Huang, Chaoqin
    Deng, Zhidong
    Shi, Jianping
    Zhou, Bolei
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 899 - 908
  • [32] SegStereo: Exploiting Semantic Information for Disparity Estimation
    Yang, Guorun
    Zhao, Hengshuang
    Shi, Jianping
    Deng, Zhidong
    Jia, Jiaya
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 660 - 676
  • [33] A Stereo Attention Module for Stereo Image Super-Resolution
    Ying, Xinyi
    Wang, Yingqian
    Wang, Longguang
    Sheng, Weidong
    An, Wei
    Guo, Yulan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 496 - 500
  • [34] Zbontar J, 2016, J MACH LEARN RES, V17
  • [35] GA-Net: Guided Aggregation Net for End-to-end Stereo Matching
    Zhang, Feihu
    Prisacariu, Victor
    Yang, Ruigang
    Torr, Philip H. S.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 185 - 194