Multi-Spectral Image Stitching via Spatial Graph Reasoning

被引：6

作者：

Jiang, Zhiying ^{[1
]}

Zhang, Zengxi ^{[1
]}

Liu, Jinyuan ^{[1
]}

Fan, Xin ^{[1
]}

Liu, Risheng ^{[1
,2
]}

机构：

[1] Dalian Univ Technol, Dalian, Peoples R China

[2] Peng Cheng Lab, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

multi-spectral image stitching; infrared and visible images; graph neural network; image fusion; NETWORK; HOMOGRAPHY; ENSEMBLE; FEATURES; FUSION; MODEL;

D O I：

10.1145/3581783.3612005

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-spectral image stitching leverages the complementarity between infrared and visible images to generate a robust and reliable wide field-of-view (FOV) scene. The primary challenge of this task is to explore the relations between multi-spectral images for aligning and integrating multi-view scenes. Capitalizing on the strengths of Graph Convolutional Networks (GCNs) in modeling feature relationships, we propose a spatial graph reasoning based multi-spectral image stitching method that effectively distills the deformation and integration of multi-spectral images across different viewpoints. To accomplish this, we embed multi-scale complementary features from the same view position into a set of nodes. The correspondence across different views is learned through powerful dense feature embeddings, where both inter- and intra-correlations are developed to exploit cross-view matching and enhance inner feature disparity. By introducing long-range coherence along spatial and channel dimensions, the complementarity of pixel relations and channel interdependencies aids in the reconstruction of aligned multi-view features, generating informative and reliable wide FOV scenes. Moreover, we release a challenging dataset named ChaMS, comprising both real-world and synthetic sets with significant parallax, providing a new option for comprehensive evaluation. Extensive experiments demonstrate that our method surpasses the state-of-the-arts.

引用

页码：472 / 480

页数：9

共 58 条

[11]

Gao JH, 2011, PROC CVPR IEEE, P49, DOI 10.1109/CVPR.2011.5995433

[12] Review on Panoramic Imaging and Its Applications in Scene Understanding [J].

Gao, Shaohua ;

Yang, Kailun ;

Shi, Hao ;

Wang, Kaiwei ;

Bai, Jian .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71

[13] Scene Graph Generation with External Knowledge and Image Reconstruction [J].

Gu, Jiuxiang ;

Zhao, Handong ;

Lin, Zhe ;

Li, Sheng ;

Cai, Jianfei ;

Ling, Mingyang .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1969-1978

[14]

Heusel M, 2017, ADV NEUR IN, V30

[15]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]

[16] ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion [J].

Huang, Zhanbo ;

Liu, Jinyuan ;

Fan, Xin ;

Liu, Risheng ;

Zhong, Wei ;

Luo, Zhongxuan .

COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 :539-555

[17] Leveraging Line-point Consistence to Preserve Structures forWide Parallax Image Stitching [J].

Jia, Qi ;

Li, ZhengJun ;

Fan, Xin ;

Zhao, Haotian ;

Teng, Shiyu ;

Ye, Xinchen ;

Latecki, Longin Jan .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12181-12190

[18] Towards All Weather and Unobstructed Multi-Spectral Image Stitching: Algorithm and Benchmark [J].

Jiang, Zhiying ;

Zhang, Zengxi ;

Fan, Xin ;

Liu, Risheng .

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :3783-3791

[19] Bilevel modeling investigated generative adversarial framework for image restoration [J].

Jiang, Zhiying ;

Zhang, Zengxi ;

Yu, Yiyao ;

Liu, Risheng .

VISUAL COMPUTER, 2023, 39 (11) :5563-5575

[20] Target Oriented Perceptual Adversarial Fusion Network for Underwater Image Enhancement [J].

Jiang, Zhiying ;

Li, Zhuoxiao ;

Yang, Shuzhou ;

Fan, Xin ;

Liu, Risheng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) :6584-6598

← 1 2 3 4 5 6 →