Coarse-to-Fine Stereo Matching Network Based on Multi-Scale Structural Information Filtrating

被引：0

作者：

Bi, Yuanwei ^{[1
]}

Li, Chuanbiao ^{[1
]}

Zheng, Qiang ^{[1
]}

Wang, Guohui ^{[1
]}

Xu, Shidong ^{[1
]}

Wang, Weiyuan ^{[1
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264003, Peoples R China

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

中国国家自然科学基金;

关键词：

INDEX TERMS Stereo matching; convolutional neural network; structural information filtrating; contextual information; ill-posed regions;

D O I：

10.1109/ACCESS.2023.3294441

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Stereo vision measurement is widely applied in tasks such as autonomous driving and 3D scene reconstruction. Accurately obtaining the disparity of stereo images relies on effective stereo matching algorithms. Compared with the traditional algorithm, the stereo matching algorithm based on convolutional neural networks (CNNs) demonstrates higher accuracy. In this paper, we propose Cs-Net, a coarse-to-fine stereo matching framework that incorporates structural information filtering, aiming to obtain accurate disparity maps. The proposed framework specifically addresses the challenge of accurate disparity estimation, and improves stereo matching in ill-posed regions, such as texture-less and reflective surfaces. To effectively tackle this challenge, the proposed framework incorporates several key modules. First, a contextual attention feature extraction module is introduced, which plays a crucial role in obtaining context information for ill-posed region. Second, a structural attention weight generation module is designed to alleviate the stereo matching errors caused by lack of structural information, and the structure boundary generated by the proposed module is proved to be related to stereo matching errors. Furthermore, a two-stage cost aggregation module is used to regularize the initial cost volume and effectively aggregate the depth information to alleviate matching errors. In the ablation experiments studies, compared to baseline algorithm (GwcNet), Cs-Net can improve D3 and EPE metrics by 14.4% and 0.16 px on the KITTI2015 validation dataset, respectively. Additionally, in the reflective regions of the KITTI2012 benchmark, compared to baseline algorithm, the D3 and D5 metrics of Cs-Net reduced by 15.3% and 20.1%. Additionally, on the DriveStereo dataset, Cs-Net exhibited significant reductions in the D3 and EPE metrics compared to the baseline algorithm, achieving a decrease of 23.5% and 0.09 px, respectively.

引用

页码：83692 / 83702

页数：11

共 50 条

[41] IMPROVEMENT OF STEREO MATCHING OF AERIAL PHOTOGRAPHS USING COARSE-TO-FINE AREA CORRELATION.
Mori, Chuji
Hattori, Susumu
Uchida, Osamu
Doboku Gakkai Rombun-Hokokushu/Proceedings of the Japan Society of Civil Engineers, 1985, (359 /IV-3): : 61 - 70
[42] Cascaded Multi-scale and Multi-dimension Convolutional Neural Network for Stereo Matching
Lu, Haihua
Xu, Hai
Zhang, Li
Ma, Yanbo
Zhao, Yong
2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
[43] Unsupervised Change Detection from Remotely Sensed Images Based on Multi-Scale Visual Saliency Coarse-to-Fine Fusion
He, Pengfei
Zhao, Xiangwei
Shi, Yuli
Cai, Liping
REMOTE SENSING, 2021, 13 (04) : 1 - 18
[44] MOTION STEREO METHOD BASED ON COARSE-TO-FINE CONTROL STRATEGY.
Xu, Gang
Tsuji, Saburo
Asada, Minoru
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, PAMI-9 (02) : 332 - 336
[45] Multi-scale inputs and context-aware aggregation network for stereo matching
Shi, Liqing
Xiong, Taiping
Cui, Gengshen
Pan, Minghua
Cheng, Nuo
Wu, Xiangjie
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
[46] Lightweight multi-scale convolutional neural network for real time stereo matching
Xue, Yanbing
Zhang, Doudou
Li, Leida
Li, Shiyin
Wang, Yuxin
IMAGE AND VISION COMPUTING, 2022, 124
[47] Coarse-to-fine airway segmentation using multi information fusion network and CNN-based region growing
Guo, Jinquan
Fu, Rongda
Pan, Lin
Zheng, Shaohua
Huang, Liqin
Zheng, Bin
He, Bingwei
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 215
[48] Coarse-to-Fine Multi-camera Network Topology Estimation
Xing, Chang
Bai, Sichen
Zhou, Yi
Zhou, Zhong
Wu, Wei
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 981 - 990
[49] MULTI-SCALE CONTEXT-GUIDED LUMBAR SPINE DISEASE IDENTIFICATION WITH COARSE-TO-FINE LOCALIZATION AND CLASSIFICATION
Chen, Zifan
Zhao, Jie
Yu, Hao
Zhang, Yue
Zhang, Li
2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
[50] ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching
Shi, Yan
Cai, Jun-Xiong
Shavit, Yoli
Mu, Tai-Jiang
Feng, Wensen
Zhang, Kai
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12507 - 12516

← 1 2 3 4 5 →