A Multi-Level Cross-Attention Image Registration Method for Visible and Infrared Small Unmanned Aerial Vehicle Targets via Image Style Transfer

被引:3
作者
Jiang, Wen [1 ]
Pan, Hanxin [1 ]
Wang, Yanping [1 ]
Li, Yang [1 ]
Lin, Yun [1 ]
Bi, Fukun [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Radar Monitoring Technol Lab, Beijing 100144, Peoples R China
基金
中国国家自然科学基金;
关键词
image registration; small UAV targets; cross-modality image; image fusion; deep learning; TRANSLATION;
D O I
10.3390/rs16162880
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Small UAV target detection and tracking based on cross-modality image fusion have gained widespread attention. Due to the limited feature information available from small UAVs in images, where they occupy a minimal number of pixels, the precision required for detection and tracking algorithms is particularly high in complex backgrounds. Image fusion techniques can enrich the detailed information for small UAVs, showing significant advantages under extreme lighting conditions. Image registration is a fundamental step preceding image fusion. It is essential to achieve accurate image alignment before proceeding with image fusion to prevent severe ghosting and artifacts. This paper specifically focused on the alignment of small UAV targets within infrared and visible light imagery. To address this issue, this paper proposed a cross-modality image registration network based on deep learning, which includes a structure preservation and style transformation network (SPSTN) and a multi-level cross-attention residual registration network (MCARN). Firstly, the SPSTN is employed for modality transformation, transferring the cross-modality task into a single-modality task to reduce the information discrepancy between modalities. Then, the MCARN is utilized for single-modality image registration, capable of deeply extracting and fusing features from pseudo infrared and visible images to achieve efficient registration. To validate the effectiveness of the proposed method, comprehensive experimental evaluations were conducted on the Anti-UAV dataset. The extensive evaluation results validate the superiority and universality of the cross-modality image registration framework proposed in this paper, which plays a crucial role in subsequent image fusion tasks for more effective target detection.
引用
收藏
页数:19
相关论文
共 29 条
[1]   Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation [J].
Arar, Moab ;
Ginger, Yiftach ;
Danon, Dov ;
Bermano, Amit H. ;
Cohen-Or, Daniel .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13407-13416
[2]   An Unsupervised Learning Model for Deformable Medical Image Registration [J].
Balakrishnan, Guha ;
Zhao, Amy ;
Sabuncu, Mert R. ;
Guttag, John ;
Dalca, Adrian V. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9252-9260
[3]   Cross-Modality Image Matching Network With Modality-Invariant Feature Representation for Airborne-Ground Thermal Infrared and Visible Datasets [J].
Cui, Song ;
Ma, Ailong ;
Wan, Yuting ;
Zhong, Yanfei ;
Luo, Bin ;
Xu, Miaozhong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning [J].
Deng, Yuxin ;
Ma, Jiayi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :591-602
[5]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[6]   Deep learning in medical image registration: a survey [J].
Haskins, Grant ;
Kruger, Uwe ;
Yan, Pingkun .
MACHINE VISION AND APPLICATIONS, 2020, 31 (01)
[7]   Weakly-supervised convolutional neural networks for multimodal image registration [J].
Hu, Yipeng ;
Modat, Marc ;
Gibson, Eli ;
Li, Wenqi ;
Ghavamia, Nooshin ;
Bonmati, Ester ;
Wang, Guotai ;
Bandula, Steven ;
Moore, Caroline M. ;
Emberton, Mark ;
Ourselin, Sebastien ;
Noble, J. Alison ;
Barratt, Dean C. ;
Vercauteren, Tom .
MEDICAL IMAGE ANALYSIS, 2018, 49 :1-13
[8]   Image-to-Image Translation with Conditional Adversarial Networks [J].
Isola, Phillip ;
Zhu, Jun-Yan ;
Zhou, Tinghui ;
Efros, Alexei A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976
[9]  
Katharopoulos A, 2020, PR MACH LEARN RES, V119
[10]   RIFT: Multi-Modal Image Matching Based on Radiation-Variation Insensitive Feature Transform [J].
Li, Jiayuan ;
Hu, Qingwu ;
Ai, Mingyao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3296-3310