Cost Volume Aggregation in Stereo Matching Revisited: A Disparity Classification Perspective

被引:1
作者
Wang, Yun [1 ,2 ]
Wang, Longguang [3 ]
Li, Kunhong [4 ]
Zhang, Yongjian [4 ]
Wu, Dapeng Oliver [5 ]
Guo, Yulan [4 ]
机构
[1] Sun Yat Sen Univ SYSU, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Aviat Univ Air Force, Coll Elect Sci & Technol, Changchun 130022, Peoples R China
[4] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[5] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo matching; depth estimation; disparity classification; cost volume; NETWORK; DEPTH;
D O I
10.1109/TIP.2024.3484251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cost aggregation plays a critical role in existing stereo matching methods. In this paper, we revisit cost aggregation in stereo matching from disparity classification and propose a generic yet efficient Disparity Context Aggregation (DCA) module to improve the performance of CNN-based methods. Our approach is based on an insight that a coarse disparity class prior is beneficial to disparity regression. To obtain such a prior, we first classify pixels in an image into several disparity classes and treat pixels within the same class as homogeneous regions. We then generate homogeneous region representations and incorporate these representations into the cost volume to suppress irrelevant information while enhancing the matching ability for cost aggregation. With the help of homogeneous region representations, efficient and informative cost aggregation can be achieved with only a shallow 3D CNN. Our DCA module is fully-differentiable and well-compatible with different network architectures, which can be seamlessly plugged into existing networks to improve performance with small additional overheads. It is demonstrated that our DCA module can effectively exploit disparity class priors to improve the performance of cost aggregation. Based on our DCA, we design a highly accurate network named DCANet, which achieves state-of-the-art performance on several benchmarks.
引用
收藏
页码:6425 / 6438
页数:14
相关论文
共 62 条
[1]   LocalBins: Improving Depth Estimation by Learning Local Distributions [J].
Bhat, Shariq Farooq ;
Alhashim, Ibraheem ;
Wonka, Peter .
COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 :480-496
[2]   AdaBins: Depth Estimation Using Adaptive Bins [J].
Bhat, Shariq Farooq ;
Alhashim, Ibraheem ;
Wonka, Peter .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4008-4017
[3]   Monocular Depth Estimation With Augmented Ordinal Depth Relationships [J].
Cao, Yuanzhouhan ;
Zhao, Tianqi ;
Xian, Ke ;
Shen, Chunhua ;
Cao, Zhiguo ;
Xu, Shugong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (08) :2674-2682
[4]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[5]   On the Over-Smoothing Problem of CNN Based Disparity Estimation [J].
Chen, Chuangrong ;
Chen, Xiaozhi ;
Cheng, Hui .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8996-9004
[6]   Two-Branch Deconvolutional Network With Application in Stereo Matching [J].
Cheng, Chunbo ;
Li, Hong ;
Zhang, Liming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 (327-340) :327-340
[7]   Learning Depth with Convolutional Spatial Propagation Network [J].
Cheng, Xinjing ;
Wang, Peng ;
Yang, Ruigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) :2361-2379
[8]   Detail Preserving Coarse-to-Fine Matching for Stereo Matching and Optical Flow [J].
Deng, Yong ;
Xiao, Jimin ;
Zhou, Steven Zhiying ;
Feng, Jiashi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :5835-5847
[9]   Confusing Image Quality Assessment: Toward Better Augmented Reality Experience [J].
Duan, Huiyu ;
Min, Xiongkuo ;
Zhu, Yucheng ;
Zhai, Guangtao ;
Yang, Xiaokang ;
Le Callet, Patrick .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :7206-7221
[10]   Road Surface 3D Reconstruction Based on Dense Subpixel Disparity Map Estimation [J].
Fan, Rui ;
Ai, Xiao ;
Dahnoun, Naim .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) :3025-3035