GRiD: Guided Refinement for Detector-Free Multimodal Image Matching

被引:0
作者
Liu, Yuyan [1 ]
He, Wei [1 ]
Zhang, Hongyan [1 ,2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430072, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Image matching; Transformers; Optical imaging; Detectors; Semantics; Image edge detection; Adaptive optics; Robustness; Remote sensing; detector-free; guided refinement; multimodal images; REGISTRATION; TRANSFORMER; MODEL;
D O I
10.1109/TIP.2024.3472491
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image matching is essential in image stitching, image fusion, change detection, and land cover mapping. However, the severe nonlinear radiometric distortion (NRD) and geometric distortions in multimodal images severely limit the accuracy of multimodal image matching, posing significant challenges to existing methods. Additionally, detector-based methods are prone to feature point offset issues in regions with substantial modal differences, which also hinder the subsequent fine registration and fusion of images. To address these challenges, we propose a guided refinement for detector-free multimodal image matching (GRiD) method, which weakens feature point offset issues by establishing pixel-level correspondences and utilizes reference points to guide and correct matches affected by NRD and geometric distortions. Specifically, we first introduce a detector-free framework to alleviate the feature point offset problem by directly finding corresponding pixels between images. Subsequently, to tackle NRD and geometric distortion in multimodal images, we design a guided correction module that establishes robust reference points (RPs) to guide the search for corresponding pixels in regions with significant modality differences. Moreover, to enhance RPs reliability, we incorporate a phase congruency module during the RPs confirmation stage to concentrate RPs around image edge structures. Finally, we perform finer localization on highly correlated corresponding pixels to obtain the optimized matches. We conduct extensive experiments on four multimodal image datasets to validate the effectiveness of the proposed approach. Experimental results demonstrate that our method can achieve sufficient and robust matches across various modality images and effectively suppress the feature point offset problem.
引用
收藏
页码:5892 / 5906
页数:15
相关论文
共 50 条
  • [21] NCFT: Automatic Matching of Multimodal Image Based on Nonlinear Consistent Feature Transform
    Yu, Kun
    Zheng, Xiao
    Duan, Yucong
    Fang, Bin
    An, Pei
    Ma, Jie
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [22] K-Means Clustering Guided Generative Adversarial Networks for SAR-Optical Image Matching
    Du, Wen-Liang
    Zhou, Yong
    Zhao, Jiaqi
    Tian, Xiaolin
    IEEE ACCESS, 2020, 8 : 217554 - 217572
  • [23] Variational Methods for Multimodal Image Matching
    Gerardo Hermosillo
    Christophe Chefd'Hotel
    Olivier Faugeras
    International Journal of Computer Vision, 2002, 50 : 329 - 343
  • [24] Variational methods for multimodal image matching
    Hermosillo, G
    Chefd'Hotel, C
    Faugeras, O
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (03) : 329 - 343
  • [25] Photovoltaic Image Registration Based on Feature Matching via Guided Spatial Consensus
    Song, Meiping
    Li, Lan
    Chen, Shuhan
    Dai, Sui
    Li, Fang
    IEEE JOURNAL OF PHOTOVOLTAICS, 2021, 11 (05): : 1118 - 1125
  • [26] Progressive Keypoint Localization and Refinement in Image Matching
    Bellavia, Fabio
    Morelli, Luca
    Colombo, Carlo
    Remondino, Fabio
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 322 - 334
  • [27] A Mutually Textual and Visual Refinement Network for Image-Text Matching
    Pang, Shanmin
    Zeng, Yueyang
    Zhao, Jiawei
    Xue, Jianru
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7555 - 7566
  • [28] Grid-Guided Sparse Laplacian Consensus for Robust Feature Matching
    Xia, Yifan
    Ma, Jiayi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1367 - 1381
  • [29] Attention-based multimodal image matching
    Moreshet, Aviad
    Keller, Yosi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [30] Multimodal image matching: A scale-invariant algorithm and an open dataset
    Li, Jiayuan
    Hu, Qingwu
    Zhang, Yongjun
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 204 : 77 - 88