CGCANET: CONTEXT-GUIDED COST AGGREGATION NETWORK FOR ROBUST STEREO MATCHING

被引：1

作者：

Sun, Wenmei ^{[1
]}

Zheng, Yuan ^{[1
]}

机构：

[1] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China

来源：

COMPUTING AND INFORMATICS | 2024年 / 43卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Stereo matching; cost computation; cost aggregation; disparity refine- ment;

D O I：

10.31577/cai_2024_2_505

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Stereo matching methods based on Convolutional Neural Network (CNN) have achieved a significant progress in recent years. However, they still cannot work well on generalization performance across a variety of datasets due to their poor robustness. In view of this, we aim to enhance the robustness in three main steps of stereo matching, namely cost computation, cost aggregation, and disparity refinement. For cost computation, we propose an atrous pyramid grouping convolution (APGC) module, which combines local context information with multi -scale features generated from CNN backbone, aiming to obtain a more discriminative feature representation. For cost aggregation, we provide a multi -scale cost aggregation (MSCA) module, which sufficiently and effectively fuses multiple cost volumes at three different scales into the 3D hourglass networks to improve initial disparity estimation. In addition, we present a disparity refinement (DR) module that employs the color guidance of left input image and several convolutional residual blocks to obtain a more accurate disparity estimation. With such three modules, we propose an end -to -end context -guided cost aggregation network (CGCANet) for robust stereo matching. To evaluate the performance of the proposed modules and CGCANet, we conduct comprehensive experiments on the challenging SceneFlow, KITTI 2015 and KITTI 2012 datasets, with a consistent and competitive improvement over the existing stereo matching methods.

引用

页码：505 / 528

页数：24

共 40 条

[1] Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation [J].

Bangunharcana, Antyanta ;

Cho, Jae Won ;

Lee, Seokju ;

Kweon, In So ;

Kim, Kyung-Soo ;

Kim, Soohyun .

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :3542-3548

[2] Pyramid Stereo Matching Network [J].

Chang, Jia-Ren ;

Chen, Yong-Sheng .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418

[3]

Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709

[4] COST AFFINITY LEARNING NETWORK FOR STEREO MATCHING [J].

Chen, Shenglun ;

Li, Baopu ;

Wang, Wei ;

Zhang, Hong ;

Li, Haojie ;

Wang, Zhihui .

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :2120-2124

[5] PGNet: Panoptic parsing guided deep stereo matching [J].

Chen, Shuya ;

Xiang, Zhiyu ;

Qiao, Chengyu ;

Chen, Yiman ;

Bai, Tingming .

NEUROCOMPUTING, 2021, 463 :609-622

[6] SGNet: Semantics Guided Deep Stereo Matching [J].

Chen, Shuya ;

Xiang, Zhiyu ;

Qiao, Chengyu ;

Chen, Yiman ;

Bai, Tingming .

COMPUTER VISION - ACCV 2020, PT I, 2021, 12622 :106-122

[7] Learning Depth with Convolutional Spatial Propagation Network [J].

Cheng, Xinjing ;

Wang, Peng ;

Yang, Ruigang .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) :2361-2379

[8]

Du Xianzhi, 2019, arXiv

[9]

Duta IC, 2020, ARXIV

[10]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

← 1 2 3 4 →