A comparative study on deep-learning methods for dense image matching of multi-angle and multi-date remote sensing stereo-images

被引:22
作者
Albanwan, Hessah [1 ,2 ]
Qin, Rongjun [1 ,2 ,3 ,4 ]
机构
[1] Ohio State Univ, Geospatial Data Analyt Lab, 218B Bolz Hall,2036 Neil Ave, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Civil Environm & Geodet Engn, Columbus, OH 43210 USA
[3] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
[4] Ohio State Univ, Translat Data Analyt Inst, Columbus, OH 43210 USA
关键词
convolutional neural network; deep learning; dense image matching; Geometry and Context Network; LEAStereo; Pyramid Stereo Matching Network; BUILDING CHANGE DETECTION; DIGITAL SURFACE MODEL; GENERATION; QUALITY;
D O I
10.1111/phor.12430
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Deep-learning (DL) stereomatching methods gained great attention in remote sensing satellite datasets. However, most of these existing studies conclude assessments based only on a few/single stereo-images lacking a systematic evaluation on how robust DL methods are on satellite stereo-images with varying radiometric and geometric configurations. This paper provides an evaluation of four DL stereomatching methods through hundreds of multi-date multi-site satellite stereopairs with varying geometric configurations, against the traditional well-practiced Census-semi-global matching (SGM), to comprehensively understand their accuracy, robustness, generalisation capabilities, and their practical potential. The DL methods include a learning-based cost metric through convolutional neural networks (MC-CNN) followed by SGM, and three end-to-end (E2E) learning models using Geometry and Context Network (GCNet), Pyramid Stereo Matching Network (PSMNet), and LEAStereo. Our experiments show that E2E algorithms can achieve upper limits of geometric accuracies, while may not generalise well for unseen data. The learning-based cost metric and Census-SGM are rather robust and can consistently achieve acceptable results. All DL algorithms are robust to geometric configurations of stereopairs and are less sensitive in comparison to the Census-SGM, while learning-based cost metrics can generalise on satellite images when trained on different datasets (airborne or ground-view).
引用
收藏
页码:385 / 409
页数:25
相关论文
共 55 条
[1]  
Albanwan H., 2020, ISPRS ANN PHOTOGRAMM, V3, P227, DOI [10.5194/isprs-annals-V-3-2020-227-2020, DOI 10.5194/ISPRS-ANNALS-V-3-2020-227-2020]
[2]   3D Iterative Spatiotemporal Filtering for Classification of Multitemporal Satellite Data Sets [J].
Albanwan, Hessah ;
Qin, Rongjun ;
Lu, Xiaohu ;
Li, Mao ;
Liu, Desheng ;
Guldmann, Jean-Michel .
PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2020, 86 (01) :23-31
[3]   Testing the quality of forest variable estimation using dense image matching: a comparison with airborne laser scanning in a Mediterranean pine forest [J].
Antonio Navarro, Jose ;
Fernandez-Landa, Alfredo ;
Luis Tome, Jose ;
Luz Guillen-Climent, Maria ;
Carlos Ojeda, Juan .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (14) :4744-4760
[4]   Large occlusion stereo [J].
Bobick, AF ;
Intille, SS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1999, 33 (03) :181-200
[5]   Semantic Stereo for Incidental Satellite Images [J].
Bosch, Marc ;
Foster, Kevin ;
Christie, Gordon ;
Wang, Sean ;
Hager, Gregory D. ;
Brown, Myron .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1524-1532
[6]  
Bosch M, 2016, IEEE APP IMG PAT
[7]   Advances in computational stereo [J].
Brown, MZ ;
Burschka, D ;
Hager, GD .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (08) :993-1008
[8]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[9]  
Chen, 2019, ARXIV, V7, P1905
[10]  
Cheng Xuelian, HIERARCHICAL NEURAL