3D Convolution Channel Compression for Stereo Matching

被引:0
|
作者
Wang, Tengfei [1 ]
Lu, Yang [1 ,2 ]
Zhang, Zhou [1 ]
Wei, Xing [1 ,3 ]
Wei, Zhen [1 ,3 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[2] Anhui Mine IOT & Secur Monitoring Technol Key Lab, Hefei 230088, Peoples R China
[3] Hefei Univ Technol, Intelligent Mfg Inst, Hefei 230009, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024 | 2024年 / 14865卷
关键词
Stereo Matching; Disparity Estimation; Depth Estimation; Model Compression; Cost Aggregation; SCENE FLOW;
D O I
10.1007/978-981-97-5591-2_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, many deep models for stereo matching employ 3D convolutions for cost aggregation to achieve better performance. However, this approach requires substantial computational and memory resources, limiting the deployment of the model on edge devices. In this paper, we analyze the challenges of channel compression in stereo matching models and adopt a straightforward and efficient compression method. Ourmethod focuses on the channel compression of the cost aggregation module, enabling the model to achieve acceleration on existing hardware, reducing significantly computational cost. We set a hyperparameter. that decides the compression rate, and compress the channels of each layer in cost aggregation module according to it. This method ensures the consistency for irregular skip connections. We extensively test our method on GwcNet, PSMNet and CFNet on multiple datasets, achieving promising results. We believe that the proposed compression method allows stereo matching models to better balance computational cost and accuracy during various degrees of channel compression, making them more suitable for deployment on edge devices.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
  • [11] CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching
    Wang, Qingyu
    Xing, Hao
    Ying, Yibin
    Zhou, Mingchuan
    PATTERN RECOGNITION LETTERS, 2023, 173 : 38 - 44
  • [12] 3D LUNAR CRATERS DETECTION BASED ON STEREO MATCHING
    Zhu, Hongmei
    Yin, Jihao
    Yuan, Ding
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 2333 - 2336
  • [13] A Stereo Matching based 3D Building Reconstruction Algorithm
    Cao, Yunyun
    Da, Feipeng
    Sui, Yihuan
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 201 - 205
  • [14] 3D shape recovery with registration assisted stereo matching
    Lin, Huei-Yung
    Liang, Sung-Chung
    Wu, Jing-Ren
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 596 - +
  • [15] A Stereo Matching based 3D Face Reconstruction Algorithm
    Fu, Youcheng
    Da, Feipeng
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 256 - 261
  • [16] Improvement of stereo matching algorithm for 3D surface reconstruction
    Hamzah, Rostam Affendi
    Kadmin, A. Fauzan
    Hamid, M. Saad
    Ghani, S. Fakhar A.
    Ibrahim, Haidi
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 65 : 165 - 172
  • [17] 3D Shape Estimation Based on Sparsity in Stereo Matching
    Hirose, Naoto
    Yasunobe, Tatsuki
    Kawanaka, Akira
    ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 562 - 571
  • [18] Adaptive stereo matching for 3D digitalization of toothless jaws
    Busch, M.
    Ruge, R.
    Kordass, B.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2007, 2 : S406 - S409
  • [19] Iterative Continuous Convolution for 3D Template Matching and Global Localization
    Guizilini, Vitor
    Ramos, Fabio
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6493 - 6500
  • [20] Fast Window Based Stereo Matching for 3D Scene Reconstruction
    Chowdhury, Mohammad Mozammel
    Bhuiyah, Mohammad Al-Amin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (03) : 209 - 214