Multi-Spectral Image Stitching via Spatial Graph Reasoning

被引:6
作者
Jiang, Zhiying [1 ]
Zhang, Zengxi [1 ]
Liu, Jinyuan [1 ]
Fan, Xin [1 ]
Liu, Risheng [1 ,2 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
multi-spectral image stitching; infrared and visible images; graph neural network; image fusion; NETWORK; HOMOGRAPHY; ENSEMBLE; FEATURES; FUSION; MODEL;
D O I
10.1145/3581783.3612005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-spectral image stitching leverages the complementarity between infrared and visible images to generate a robust and reliable wide field-of-view (FOV) scene. The primary challenge of this task is to explore the relations between multi-spectral images for aligning and integrating multi-view scenes. Capitalizing on the strengths of Graph Convolutional Networks (GCNs) in modeling feature relationships, we propose a spatial graph reasoning based multi-spectral image stitching method that effectively distills the deformation and integration of multi-spectral images across different viewpoints. To accomplish this, we embed multi-scale complementary features from the same view position into a set of nodes. The correspondence across different views is learned through powerful dense feature embeddings, where both inter- and intra-correlations are developed to exploit cross-view matching and enhance inner feature disparity. By introducing long-range coherence along spatial and channel dimensions, the complementarity of pixel relations and channel interdependencies aids in the reconstruction of aligned multi-view features, generating informative and reliable wide FOV scenes. Moreover, we release a challenging dataset named ChaMS, comprising both real-world and synthetic sets with significant parallax, providing a new option for comprehensive evaluation. Extensive experiments demonstrate that our method surpasses the state-of-the-arts.
引用
收藏
页码:472 / 480
页数:9
相关论文
共 58 条
[11]  
Gao JH, 2011, PROC CVPR IEEE, P49, DOI 10.1109/CVPR.2011.5995433
[12]   Review on Panoramic Imaging and Its Applications in Scene Understanding [J].
Gao, Shaohua ;
Yang, Kailun ;
Shi, Hao ;
Wang, Kaiwei ;
Bai, Jian .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[13]   Scene Graph Generation with External Knowledge and Image Reconstruction [J].
Gu, Jiuxiang ;
Zhao, Handong ;
Lin, Zhe ;
Li, Sheng ;
Cai, Jianfei ;
Ling, Mingyang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1969-1978
[14]  
Heusel M, 2017, ADV NEUR IN, V30
[15]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[16]   ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion [J].
Huang, Zhanbo ;
Liu, Jinyuan ;
Fan, Xin ;
Liu, Risheng ;
Zhong, Wei ;
Luo, Zhongxuan .
COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 :539-555
[17]   Leveraging Line-point Consistence to Preserve Structures forWide Parallax Image Stitching [J].
Jia, Qi ;
Li, ZhengJun ;
Fan, Xin ;
Zhao, Haotian ;
Teng, Shiyu ;
Ye, Xinchen ;
Latecki, Longin Jan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12181-12190
[18]   Towards All Weather and Unobstructed Multi-Spectral Image Stitching: Algorithm and Benchmark [J].
Jiang, Zhiying ;
Zhang, Zengxi ;
Fan, Xin ;
Liu, Risheng .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :3783-3791
[19]   Bilevel modeling investigated generative adversarial framework for image restoration [J].
Jiang, Zhiying ;
Zhang, Zengxi ;
Yu, Yiyao ;
Liu, Risheng .
VISUAL COMPUTER, 2023, 39 (11) :5563-5575
[20]   Target Oriented Perceptual Adversarial Fusion Network for Underwater Image Enhancement [J].
Jiang, Zhiying ;
Li, Zhuoxiao ;
Yang, Shuzhou ;
Fan, Xin ;
Liu, Risheng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) :6584-6598