GRiD: Guided Refinement for Detector-Free Multimodal Image Matching

被引：0

作者：

Liu, Yuyan ^{[1
]}

He, Wei ^{[1
]}

Zhang, Hongyan ^{[1
,2
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430072, Peoples R China

[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Image matching; Transformers; Optical imaging; Detectors; Semantics; Image edge detection; Adaptive optics; Robustness; Remote sensing; detector-free; guided refinement; multimodal images; REGISTRATION; TRANSFORMER; MODEL;

D O I：

10.1109/TIP.2024.3472491

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image matching is essential in image stitching, image fusion, change detection, and land cover mapping. However, the severe nonlinear radiometric distortion (NRD) and geometric distortions in multimodal images severely limit the accuracy of multimodal image matching, posing significant challenges to existing methods. Additionally, detector-based methods are prone to feature point offset issues in regions with substantial modal differences, which also hinder the subsequent fine registration and fusion of images. To address these challenges, we propose a guided refinement for detector-free multimodal image matching (GRiD) method, which weakens feature point offset issues by establishing pixel-level correspondences and utilizes reference points to guide and correct matches affected by NRD and geometric distortions. Specifically, we first introduce a detector-free framework to alleviate the feature point offset problem by directly finding corresponding pixels between images. Subsequently, to tackle NRD and geometric distortion in multimodal images, we design a guided correction module that establishes robust reference points (RPs) to guide the search for corresponding pixels in regions with significant modality differences. Moreover, to enhance RPs reliability, we incorporate a phase congruency module during the RPs confirmation stage to concentrate RPs around image edge structures. Finally, we perform finer localization on highly correlated corresponding pixels to obtain the optimized matches. We conduct extensive experiments on four multimodal image datasets to validate the effectiveness of the proposed approach. Experimental results demonstrate that our method can achieve sufficient and robust matches across various modality images and effectively suppress the feature point offset problem.

引用

页码：5892 / 5906

页数：15

共 50 条

[21] NCFT: Automatic Matching of Multimodal Image Based on Nonlinear Consistent Feature Transform
Yu, Kun
Zheng, Xiao
Duan, Yucong
Fang, Bin
An, Pei
Ma, Jie
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[22] K-Means Clustering Guided Generative Adversarial Networks for SAR-Optical Image Matching
Du, Wen-Liang
Zhou, Yong
Zhao, Jiaqi
Tian, Xiaolin
IEEE ACCESS, 2020, 8 : 217554 - 217572
[23] Variational Methods for Multimodal Image Matching
Gerardo Hermosillo
Christophe Chefd'Hotel
Olivier Faugeras
International Journal of Computer Vision, 2002, 50 : 329 - 343
[24] Variational methods for multimodal image matching
Hermosillo, G
Chefd'Hotel, C
Faugeras, O
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (03) : 329 - 343
[25] Photovoltaic Image Registration Based on Feature Matching via Guided Spatial Consensus
Song, Meiping
Li, Lan
Chen, Shuhan
Dai, Sui
Li, Fang
IEEE JOURNAL OF PHOTOVOLTAICS, 2021, 11 (05): : 1118 - 1125
[26] Progressive Keypoint Localization and Refinement in Image Matching
Bellavia, Fabio
Morelli, Luca
Colombo, Carlo
Remondino, Fabio
IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 322 - 334
[27] A Mutually Textual and Visual Refinement Network for Image-Text Matching
Pang, Shanmin
Zeng, Yueyang
Zhao, Jiawei
Xue, Jianru
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7555 - 7566
[28] Grid-Guided Sparse Laplacian Consensus for Robust Feature Matching
Xia, Yifan
Ma, Jiayi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1367 - 1381
[29] Attention-based multimodal image matching
Moreshet, Aviad
Keller, Yosi
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
[30] Multimodal image matching: A scale-invariant algorithm and an open dataset
Li, Jiayuan
Hu, Qingwu
Zhang, Yongjun
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 204 : 77 - 88

← 1 2 3 4 5 →