Multimodal image matching: A scale-invariant algorithm and an open dataset

被引:19
作者
Li, Jiayuan [1 ]
Hu, Qingwu [1 ]
Zhang, Yongjun [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
关键词
Image matching; Feature descriptor; Dataset; SAR-optical; Multimodal images; AUTOMATIC REGISTRATION; FRAMEWORK; SAR; FUSION;
D O I
10.1016/j.isprsjprs.2023.08.010
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Multimodal image matching is a core basis for information fusion, change detection, and image-based navigation. However, multimodal images may simultaneously suffer from severe nonlinear radiation distortion (NRD) and complex geometric differences, which pose great challenges to existing methods. Although deep learning-based methods had shown potential in image matching, they mainly focus on same-source images or single types of multimodal images such as optical-synthetic aperture radar (SAR). One of the main obstacles is the lack of public data for different types of multimodal images. In this paper, we make two major contributions to the community of multimodal image matching: First, we collect six typical types of images, including optical-optical, optical-infrared, optical-SAR, optical-depth, optical-map, and nighttime, to construct a multimodal image dataset with a total of 1200 pairs. This dataset has good diversity in image categories, feature classes, resolutions, geometric variations, etc. Second, we propose a scale and rotation invariant feature transform (SRIF) method, which achieves good matching performance without relying on data characteristics. This is one of the advantages of our SRIF over deep learning methods. SRIF obtains the scales of FAST keypoints by projecting them into a simple pyramid scale space, which is based on the study that methods with/without scale space have similar performance under small scale change factors. This strategy largely reduces the complexity compared to traditional Gaussian scale space. SRIF also proposes a local intensity binary transform (LIBT) for SIFT-like feature description, which can largely enhance the structure information inside multimodal images. Extensive experiments on these 1200 image pairs show that our SRIF outperforms current state-of-the-arts by a large margin, including RIFT, CoFSM, LNIFT, and MS-HLMO. Both the created dataset and the code of SRIF will be publicly available in https://github.com/LJY-RS/SRIF.
引用
收藏
页码:77 / 88
页数:12
相关论文
共 76 条
  • [61] A Robust Two-Stage Registration Algorithm for Large Optical and SAR Images
    Xiang, Yuming
    Jiao, Niangang
    Wang, Feng
    You, Hongjian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [62] Automatic Registration of Optical and SAR Images Via Improved Phase Congruency Model
    Xiang, Yuming
    Tao, Rongshu
    Wang, Feng
    You, Hongjian
    Han, Bing
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 5847 - 5861
  • [63] OS-SIFT: A Robust SIFT-Like Algorithm for High-Resolution Optical-to-SAR Image Registration in Suburban Areas
    Xiang, Yuming
    Wang, Feng
    You, Hongjian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3078 - 3090
  • [64] Self-Similarity Features for Multimodal Remote Sensing Image Matching
    Xiong, Xin
    Jin, Guowang
    Xu, Qing
    Zhang, Hongmin
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 12440 - 12454
  • [65] Multi-Modal Remote Sensing Image Matching Considering Co-Occurrence Filter
    Yao, Yongxiang
    Zhang, Yongjun
    Wan, Yi
    Liu, Xinyi
    Yan, Xiaohu
    Li, Jiayuan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2584 - 2597
  • [66] A Multiscale Framework With Unsupervised Learning for Remote Sensing Image Registration
    Ye, Yuanxin
    Tang, Tengfeng
    Zhu, Bai
    Yang, Chao
    Li, Bo
    Hao, Siyuan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [67] Fast and Robust Matching for Multimodal Remote Sensing Image Registration
    Ye, Yuanxin
    Bruzzone, Lorenzo
    Shan, Jie
    Bovolo, Francesca
    Zhu, Qing
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9059 - 9070
  • [68] Robust Registration of Multimodal Remote Sensing Images Based on Structural Similarity
    Ye, Yuanxin
    Shan, Jie
    Bruzzone, Lorenzo
    Shen, Li
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (05): : 2941 - 2958
  • [69] LIFT: Learned Invariant Feature Transform
    Yi, Kwang Moo
    Trulls, Eduard
    Lepetit, Vincent
    Fua, Pascal
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 467 - 483
  • [70] Universal SAR and optical image registration via a novel SIFT framework based on nonlinear diffusion and a polar spatial-frequency descriptor
    Yu, Qiuze
    Ni, Dawen
    Jiang, Yuxuan
    Yan, Yuxuan
    An, Jiachun
    Sun, Tao
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 171 : 1 - 17