Coarse-to-Fine Stereo Matching Network Based on Multi-Scale Structural Information Filtrating

被引:0
|
作者
Bi, Yuanwei [1 ]
Li, Chuanbiao [1 ]
Zheng, Qiang [1 ]
Wang, Guohui [1 ]
Xu, Shidong [1 ]
Wang, Weiyuan [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264003, Peoples R China
基金
中国国家自然科学基金;
关键词
INDEX TERMS Stereo matching; convolutional neural network; structural information filtrating; contextual information; ill-posed regions;
D O I
10.1109/ACCESS.2023.3294441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stereo vision measurement is widely applied in tasks such as autonomous driving and 3D scene reconstruction. Accurately obtaining the disparity of stereo images relies on effective stereo matching algorithms. Compared with the traditional algorithm, the stereo matching algorithm based on convolutional neural networks (CNNs) demonstrates higher accuracy. In this paper, we propose Cs-Net, a coarse-to-fine stereo matching framework that incorporates structural information filtering, aiming to obtain accurate disparity maps. The proposed framework specifically addresses the challenge of accurate disparity estimation, and improves stereo matching in ill-posed regions, such as texture-less and reflective surfaces. To effectively tackle this challenge, the proposed framework incorporates several key modules. First, a contextual attention feature extraction module is introduced, which plays a crucial role in obtaining context information for ill-posed region. Second, a structural attention weight generation module is designed to alleviate the stereo matching errors caused by lack of structural information, and the structure boundary generated by the proposed module is proved to be related to stereo matching errors. Furthermore, a two-stage cost aggregation module is used to regularize the initial cost volume and effectively aggregate the depth information to alleviate matching errors. In the ablation experiments studies, compared to baseline algorithm (GwcNet), Cs-Net can improve D3 and EPE metrics by 14.4% and 0.16 px on the KITTI2015 validation dataset, respectively. Additionally, in the reflective regions of the KITTI2012 benchmark, compared to baseline algorithm, the D3 and D5 metrics of Cs-Net reduced by 15.3% and 20.1%. Additionally, on the DriveStereo dataset, Cs-Net exhibited significant reductions in the D3 and EPE metrics compared to the baseline algorithm, achieving a decrease of 23.5% and 0.09 px, respectively.
引用
收藏
页码:83692 / 83702
页数:11
相关论文
共 50 条
  • [41] IMPROVEMENT OF STEREO MATCHING OF AERIAL PHOTOGRAPHS USING COARSE-TO-FINE AREA CORRELATION.
    Mori, Chuji
    Hattori, Susumu
    Uchida, Osamu
    Doboku Gakkai Rombun-Hokokushu/Proceedings of the Japan Society of Civil Engineers, 1985, (359 /IV-3): : 61 - 70
  • [42] Cascaded Multi-scale and Multi-dimension Convolutional Neural Network for Stereo Matching
    Lu, Haihua
    Xu, Hai
    Zhang, Li
    Ma, Yanbo
    Zhao, Yong
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [43] Unsupervised Change Detection from Remotely Sensed Images Based on Multi-Scale Visual Saliency Coarse-to-Fine Fusion
    He, Pengfei
    Zhao, Xiangwei
    Shi, Yuli
    Cai, Liping
    REMOTE SENSING, 2021, 13 (04) : 1 - 18
  • [44] MOTION STEREO METHOD BASED ON COARSE-TO-FINE CONTROL STRATEGY.
    Xu, Gang
    Tsuji, Saburo
    Asada, Minoru
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, PAMI-9 (02) : 332 - 336
  • [45] Multi-scale inputs and context-aware aggregation network for stereo matching
    Shi, Liqing
    Xiong, Taiping
    Cui, Gengshen
    Pan, Minghua
    Cheng, Nuo
    Wu, Xiangjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
  • [46] Lightweight multi-scale convolutional neural network for real time stereo matching
    Xue, Yanbing
    Zhang, Doudou
    Li, Leida
    Li, Shiyin
    Wang, Yuxin
    IMAGE AND VISION COMPUTING, 2022, 124
  • [47] Coarse-to-fine airway segmentation using multi information fusion network and CNN-based region growing
    Guo, Jinquan
    Fu, Rongda
    Pan, Lin
    Zheng, Shaohua
    Huang, Liqin
    Zheng, Bin
    He, Bingwei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 215
  • [48] Coarse-to-Fine Multi-camera Network Topology Estimation
    Xing, Chang
    Bai, Sichen
    Zhou, Yi
    Zhou, Zhong
    Wu, Wei
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 981 - 990
  • [49] MULTI-SCALE CONTEXT-GUIDED LUMBAR SPINE DISEASE IDENTIFICATION WITH COARSE-TO-FINE LOCALIZATION AND CLASSIFICATION
    Chen, Zifan
    Zhao, Jie
    Yu, Hao
    Zhang, Yue
    Zhang, Li
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [50] ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching
    Shi, Yan
    Cai, Jun-Xiong
    Shavit, Yoli
    Mu, Tai-Jiang
    Feng, Wensen
    Zhang, Kai
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12507 - 12516