Coarse-to-Fine Stereo Matching Network Based on Multi-Scale Structural Information Filtrating

被引:0
|
作者
Bi, Yuanwei [1 ]
Li, Chuanbiao [1 ]
Zheng, Qiang [1 ]
Wang, Guohui [1 ]
Xu, Shidong [1 ]
Wang, Weiyuan [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264003, Peoples R China
基金
中国国家自然科学基金;
关键词
INDEX TERMS Stereo matching; convolutional neural network; structural information filtrating; contextual information; ill-posed regions;
D O I
10.1109/ACCESS.2023.3294441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stereo vision measurement is widely applied in tasks such as autonomous driving and 3D scene reconstruction. Accurately obtaining the disparity of stereo images relies on effective stereo matching algorithms. Compared with the traditional algorithm, the stereo matching algorithm based on convolutional neural networks (CNNs) demonstrates higher accuracy. In this paper, we propose Cs-Net, a coarse-to-fine stereo matching framework that incorporates structural information filtering, aiming to obtain accurate disparity maps. The proposed framework specifically addresses the challenge of accurate disparity estimation, and improves stereo matching in ill-posed regions, such as texture-less and reflective surfaces. To effectively tackle this challenge, the proposed framework incorporates several key modules. First, a contextual attention feature extraction module is introduced, which plays a crucial role in obtaining context information for ill-posed region. Second, a structural attention weight generation module is designed to alleviate the stereo matching errors caused by lack of structural information, and the structure boundary generated by the proposed module is proved to be related to stereo matching errors. Furthermore, a two-stage cost aggregation module is used to regularize the initial cost volume and effectively aggregate the depth information to alleviate matching errors. In the ablation experiments studies, compared to baseline algorithm (GwcNet), Cs-Net can improve D3 and EPE metrics by 14.4% and 0.16 px on the KITTI2015 validation dataset, respectively. Additionally, in the reflective regions of the KITTI2012 benchmark, compared to baseline algorithm, the D3 and D5 metrics of Cs-Net reduced by 15.3% and 20.1%. Additionally, on the DriveStereo dataset, Cs-Net exhibited significant reductions in the D3 and EPE metrics compared to the baseline algorithm, achieving a decrease of 23.5% and 0.09 px, respectively.
引用
收藏
页码:83692 / 83702
页数:11
相关论文
共 50 条
  • [1] A coarse-to-fine registration network based on affine transformation and multi-scale pyramid
    Li, Dongming
    Li, Yingjian
    Li, Jinxing
    Lu, Guangming
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [2] Multi-Scale Coarse-to-Fine Transformer for Frame Interpolation
    Li, Chen
    Song, Li
    Zou, Xueyi
    Guo, Jiaming
    Yan, Youliang
    Zhang, Wenjun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5201 - 5209
  • [3] Anisotropic stereo matching with multi-scale information
    Li Y.
    Wu M.
    Liu K.
    Yu W.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (09): : 2920 - 2928
  • [4] Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection
    Li, Zekun
    Liu, Yufan
    Li, Bing
    Hu, Weiming
    Zhou, Xue
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] Detail Preserving Coarse-to-Fine Matching for Stereo Matching and Optical Flow
    Deng, Yong
    Xiao, Jimin
    Zhou, Steven Zhiying
    Feng, Jiashi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5835 - 5847
  • [6] Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion
    Hao Zhao
    Jingrun Zheng
    Xiaoke Shang
    Wei Zhong
    Jinyuan Liu
    The Visual Computer, 2024, 40 : 1697 - 1710
  • [7] Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion
    Zhao, Hao
    Zheng, Jingrun
    Shang, Xiaoke
    Zhong, Wei
    Liu, Jinyuan
    VISUAL COMPUTER, 2024, 40 (03): : 1697 - 1710
  • [8] Coarse-to-Fine Point Cloud Shape Complementation Based on Multi-Scale Feature Fusion
    Zhang D.
    Wang Y.
    Tan X.
    Wu Y.
    Chen Y.
    He F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (04): : 523 - 532
  • [9] A coarse-to-fine multi-scale feature hybrid low-dose CT denoising network
    Han, Zefang
    Hong, Shangguan
    Xiong, Zhang
    Cui, Xueying
    Yue, Wang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
  • [10] Multi-scale Coarse-to-Fine Segmentation for Screening Pancreatic Ductal Adenocarcinoma
    Zhu, Zhuotun
    Xia, Yingda
    Xie, Lingxi
    Fishman, Elliot K.
    Yuille, Alan L.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 3 - 12