Efficient Discrimination and Localization of Multimodal Remote Sensing Images Using CNN-Based Prediction of Localization Uncertainty

被引:12
|
作者
Uss, Mykhail [1 ]
Vozel, Benoit [2 ]
Lukin, Vladimir [1 ]
Chehdi, Kacem [2 ]
机构
[1] Natl Aerosp Univ, Dept Informat Commun Technol, UA-61070 Kharkiv, Ukraine
[2] Univ Rennes 1, IETR UMR CNRS 6164, F-22305 Lannion, France
关键词
multimodal images; remote sensing; similarity measure; localization accuracy; localization uncertainty; regression uncertainty; deep learning; convolutional neural networks; two-channel CNN; REGISTRATION; SIMILARITY;
D O I
10.3390/rs12040703
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Detecting similarities between image patches and measuring their mutual displacement are important parts in the registration of multimodal remote sensing (RS) images. Deep learning approaches advance the discriminative power of learned similarity measures (SM). However, their ability to find the best spatial alignment of the compared patches is often ignored. We propose to unify the patch discrimination and localization problems by assuming that the more accurately two patches can be aligned, the more similar they are. The uncertainty or confidence in the localization of a patch pair serves as a similarity measure of these patches. We train a two-channel patch matching convolutional neural network (CNN), called DLSM, to solve a regression problem with uncertainty. This CNN inputs two multimodal patches, and outputs a prediction of the translation vector between the input patches as well as the uncertainty of this prediction in the form of an error covariance matrix of the translation vector. The proposed patch matching CNN predicts a normal two-dimensional distribution of the translation vector rather than a simple value of it. The determinant of the covariance matrix is used as a measure of uncertainty in the matching of patches and also as a measure of similarity between patches. For training, we used the Siamese architecture with three towers. During training, the input of two towers is the same pair of multimodal patches but shifted by a random translation; the last tower is fed by a pair of dissimilar patches. Experiments performed on a large base of real RS images show that the proposed DLSM has both a higher discriminative power and a more precise localization compared to existing hand-crafted SMs and SMs trained with conventional losses. Unlike existing SMs, DLSM correctly predicts translation error distribution ellipse for different modalities, noise level, isotropic, and anisotropic structures.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] FPGA Implementation for CNN-Based Optical Remote Sensing Object Detection
    Zhang, Ning
    Wei, Xin
    Chen, He
    Liu, Wenchao
    ELECTRONICS, 2021, 10 (03) : 1 - 24
  • [32] Integrating Coordinate Features in CNN-Based Remote Sensing Imagery Classification
    Zhang, Fan
    Yan, Minchao
    Hu, Chen
    Ni, Jun
    Zhou, Yongsheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] A Configurable Accelerator for CNN-Based Remote Sensing Object Detection on FPGAs
    Shao, Yingzhao
    Shang, Jincheng
    Li, Yunsong
    Ding, Yueli
    Zhang, Mingming
    Ren, Ke
    Liu, Yang
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2024, 2024
  • [34] The CNN-based Coronary Occlusion Site Localization with Effective Preprocessing Method
    Park, YeongHyeon
    Yun, Il Dong
    Kang, Si-Hyuck
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (10) : 1549 - 1551
  • [35] CNN-BASED SPOKEN TERM DETECTION AND LOCALIZATION WITHOUT DYNAMIC PROGRAMMING
    Fuchs, Tzeviya Sylvia
    Segal, Yael
    Keshet, Joseph
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6853 - 6857
  • [36] CNN-Based Object Detection and Distance Prediction for Autonomous Driving Using Stereo Images
    Jin Gyu Song
    Joon Woong Lee
    International Journal of Automotive Technology, 2023, 24 : 773 - 786
  • [37] CNN-Based Object Detection and Distance Prediction for Autonomous Driving Using Stereo Images
    Song, Jin Gyu
    Lee, Joon Woong
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2023, 24 (03) : 773 - 786
  • [38] A novel CNN-based method for localization of temperature anomalies in RDTS system
    Wang, Honghui
    Zeng, Shangkun
    Wang, Sibo
    Wang, Yuhang
    OPTICS COMMUNICATIONS, 2024, 558
  • [39] CNN-BASED INDOOR OCCUPANT LOCALIZATION VIA ACTIVE SCENE ILLUMINATION
    Zhao, Jinyuan
    Frumkin, Natalia
    Ishwar, Prakash
    Konrad, Janusz
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2636 - 2640
  • [40] A CNN-based Method for Adaptive Landmark Selection in Remote Sensing Image
    Chen Yongzhan
    Yang Weidong
    Cao Yaoxin
    Liu Chenhua
    Yang Dong
    MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429