An improved binocular stereo matching algorithm based on AANet

被引:0
作者
Ge Yang
Yuting Liao
机构
[1] Beijing Normal University,Advanced Institute of Natural Sciences, Key Laboratory of Intelligent Multimedia Technology
[2] Peking University,Engineering Lab On Intelligent Perception for Internet of Things (ELIP), Shenzhen Graduate School
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Three-dimensional displays; Stereo matching; Feature extraction; Neural networks; Adaptation models; Training;
D O I
暂无
中图分类号
学科分类号
摘要
Stereo matching is an important part of establishing stereo vision. Parallax information obtained by stereo matching directly affects the three-dimensional information of an object. End-to-end stereo matching algorithms can directly derive parallax maps from the designed network. However, at the same time, the structure of the network is complex, and a large number of parameters take up much memory. The network increases the device burden, which increases the time required to obtain the parallax map, lowering the efficiency of the network movement. Thus, an improved stereo matching algorithm based on AANet (adaptive aggregation network for efficient stereo matching) is proposed in this paper: AEDNet (adaptive end-to-end depth network for stereo matching). In the feature extraction module, the network simplifies the network structure by limiting the convolution kernel size to obtain the features with low abstraction. In cost aggregation, the intra-scale aggregation module is used to achieve adaptive cost aggregation through deformable convolution, and the inter-scale aggregation module uses the traditional cross-scale aggregation method to compensate for the missing global information to a certain extent. The network is verified the performance on the KITTI dataset. The results show that the algorithm can still complete stereo matching efficiently and accurately and obtain a better disparity map when the network is simplified. These provide preconditions for accurate three-dimensional reconstruction.
引用
收藏
页码:40987 / 41003
页数:16
相关论文
共 96 条
[1]  
Aleotti F(2020)Learning End-to-End Scene Flow by Distilling Single Tasks Knowledge[C] Nat Conf Artif Intell 34 10435-10442
[2]  
Poggi M(2022)Local Similarity-Based Spatial-Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering[J] IEEE Trans Geosci Remote Sens 60 5514215-5514215
[3]  
Tosi F(2021)End-to-End Learning for Omnidirectional Stereo Matching With Uncertainty Prior[J] IEEE Trans Pattern Anal Mach Intell 43 3850-3862
[4]  
Bhatti Uzair Aslam(2022)Multi-Dimensional Cooperative Network for Stereo Matching[J] IEEE Robot Autom Lett 7 581-587
[5]  
Zhaoyuan Yu(2021)SRH-Net: Stacked Recurrent Hourglass Network for Stereo Matching[J] IEEE Robot Autom Lett 6 8005-8012
[6]  
Chanussot Jocelyn(2021)Adversarial Confidence Estimation Networks for Robust Stereo Matching[J] IEEE Trans Intell Transp Syst 22 6875-6889
[7]  
Zeeshan Zeeshan(2022)A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation[J] IEEE Trans Pattern Anal Mach Intell 44 1738-1764
[8]  
Yuan Linwang(2022)A High-Throughput Depth Estimation Processor for Accurate Semiglobal Stereo Matching Using Pipelined Inter-Pixel Aggregation[J] IEEE Trans Circuits Syst Vid Technol 32 411-422
[9]  
Luo Wen(2021)Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy[J] IEEE Trans Pattern Anal Mach Intell 43 300-315
[10]  
Nawaz Saqib Ali(2022)Adaptive Cost Volume Fusion Network for Multi-Modal Depth Estimation in Changing Environments[J] IEEE Robot Autom Lett 7 5095-5102