FREQUENCY-SPATIAL DOMAIN INFORMATION FUSION NETWORK FOR PAN-SHARPENING

被引:0
作者
Zhao, Mengjiao [1 ]
Ma, Mengting [2 ]
Gao, Ao [1 ]
Zhang, Wei [1 ,3 ]
机构
[1] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[3] Zhejiang Univ, Innovat Ctr Yangtze River Delta, Jiaxing, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2024年
关键词
Pan-sharpening; remote sensing; IMAGE FUSION;
D O I
10.1109/ICIP51287.2024.10647420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pan-sharpening aims to fuse panchromatic (PAN) images with low-resolution multi-spectral (LR-MS) images to generate high-resolution multi-spectral (HR-MS) images. Despite the impressive performance of existing learning-based methods, they are constrained by coarse fusion strategies in frequency or spatial domain. In this paper, we discover that PAN images can provide all the spatial textures required for HR-MS images, while spectral information must be provided jointly by PAN and LR-MS images. Inspired by this, we propose a noval frequency-spatial domain information fusion network for pan-sharpening, called FSDNet. Specifically, we design a Dual-Domain Information Processing Module (DDPM) to construct FSDNet. It consists of a Frequency Domain Feature Processing Block (FDB), a Spatial Domain Information Processing Block (SDB), and an Information Fusion Block (IFB). The FDB in the frequency domain uses the Adaptive Amplitude Fusion Block (AAFB) and convolution layers to finely modulate amplitude and phase components, exploring global information. The SDB uses cascaded residual blocks to capture and enhance local information in the spatial domain. The IFB based on invertible neural networks (INNs) introduces Multi-Scale Self-Attention Block (MSAB), achieves effective information fusion and reduce information loss. Extensive experiments on the QuickBird and GaoFen-2 datasets demonstrate the effectiveness and superiority of our method.
引用
收藏
页码:1718 / 1724
页数:7
相关论文
共 22 条
[1]   A variational model for P+XS image fusion [J].
Ballester, Coloma ;
Caselles, Vicent ;
Igual, Laura ;
Verdera, Joan ;
Rougé, Bernard .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 69 (01) :43-58
[2]   Super-Resolution-Guided Progressive Pansharpening Based on a Deep Convolutional Neural Network [J].
Cai, Jiajun ;
Huang, Bo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06) :5206-5220
[3]  
CARPER WJ, 1990, PHOTOGRAMM ENG REM S, V56, P459
[4]  
CHAVEZ PS, 1989, PHOTOGRAMM ENG REM S, V55, P339
[5]  
Dinh L, 2017, Arxiv, DOI [arXiv:1605.08803, 10.48550/arXiv.1605.08803 1605.08803]
[6]  
Frigo M, 1998, INT CONF ACOUST SPEE, P1381, DOI 10.1109/ICASSP.1998.681704
[7]   COLOR ENHANCEMENT OF HIGHLY CORRELATED IMAGES .2. CHANNEL RATIO AND CHROMATICITY TRANSFORMATION TECHNIQUES [J].
GILLESPIE, AR ;
KAHLE, AB ;
WALKER, RE .
REMOTE SENSING OF ENVIRONMENT, 1987, 22 (03) :343-365
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   Bicubic interpolation and extrapolation iteration method for high resolution digital holographic reconstruction [J].
Huang, Zhengzhong ;
Cao, Liangcai .
OPTICS AND LASERS IN ENGINEERING, 2020, 130
[10]  
Li MS, 2023, PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, P1071