Accurate and Efficient Stereo Matching via Attention Concatenation Volume

被引:21
作者
Xu, Gangwei [1 ]
Wang, Yun [1 ]
Cheng, Junda [1 ]
Tang, Jinhui [2 ]
Yang, Xin [1 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Elect Informat & Commun, Wuhan 430074, Peoples R China
[2] Nanjing Univ Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Correlation; Three-dimensional displays; Volume measurement; Real-time systems; Solid modeling; Aggregates; Stereo matching; cost volume construction; attention concatenation volume; attention filtering; PROPAGATION;
D O I
10.1109/TPAMI.2023.3335480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo matching is a fundamental building block for many vision and robotics applications. An informative and concise cost volume representation is vital for stereo matching of high accuracy and efficiency. In this article, we present a novel cost volume construction method, named attention concatenation volume (ACV), which generates attention weights from correlation clues to suppress redundant information and enhance matching-related information in the concatenation volume. The ACV can be seamlessly embedded into most stereo matching networks, the resulting networks can use a more lightweight aggregation network and meanwhile achieve higher accuracy. We further design a fast version of ACV to enable real-time performance, named Fast-ACV, which generates high likelihood disparity hypotheses and the corresponding attention weights from low-resolution correlation clues to significantly reduce computational and memory cost and meanwhile maintain a satisfactory accuracy. Furthermore, we design a highly accurate network ACVNet and a real-time network Fast-ACVNet based on our ACV and Fast-ACV respectively, which achieve state-of-the-art performance on several benchmarks.
引用
收藏
页码:2461 / 2474
页数:14
相关论文
共 37 条
[1]   Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation [J].
Bangunharcana, Antyanta ;
Cho, Jae Won ;
Lee, Seokju ;
Kweon, In So ;
Kim, Kyung-Soo ;
Kim, Soohyun .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :3542-3548
[2]   PMBP: PatchMatch Belief Propagation for Correspondence Field Estimation [J].
Besse, Frederic ;
Rother, Carsten ;
Fitzgibbon, Andrew ;
Kautz, Jan .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (01) :2-13
[3]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[4]   Coatrsnet: Fully Exploiting Convolution and Attention for Stereo Matching by Region Separation [J].
Cheng, Junda ;
Xu, Gangwei ;
Guo, Peng ;
Yang, Xin .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (01) :56-73
[5]   Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness [J].
Cheng, Shuo ;
Xu, Zexiang ;
Zhu, Shilin ;
Li, Zhuwen ;
Li, Li Erran ;
Ramamoorthi, Ravi ;
Su, Hao .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2521-2531
[6]   Learning Depth with Convolutional Spatial Propagation Network [J].
Cheng, Xinjing ;
Wang, Peng ;
Yang, Ruigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) :2361-2379
[7]  
Cheng Xuelian, Hierarchical Neural Architecture Search for Deep Stereo Matching
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch [J].
Duggal, Shivam ;
Wang, Shenlong ;
Ma, Wei-Chiu ;
Hu, Rui ;
Urtasun, Raquel .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4383-4392
[10]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074