Two-stream spatiotemporal image fusion network based on difference transformation

被引：0

作者：

Fang, Shuai ^{[1
,2
]}

Meng, Siyuan ^{[1
]}

Zhang, Jing ^{[1
]}

Cao, Yang ^{[3
,4
]}

机构：

[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Hefei, Peoples R China

[2] Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei, Peoples R China

[3] Univ Sci & Technol China, Dept Automat, Hefei, Peoples R China

[4] Univ Sci & Technol China, Inst Adv Technol, Hefei, Peoples R China

来源：

JOURNAL OF APPLIED REMOTE SENSING | 2022年 / 16卷 / 03期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

spatiotemporal image fusion; convolutional neural network; deep learning; remote sensing; REFLECTANCE FUSION; LANDSAT; MODIS; DYNAMICS; MODEL; NDVI;

D O I：

10.1117/1.JRS.16.038506

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

For satellite imaging instruments, the tradeoff between spatial and temporal resolution leads to the spatial-temporal contradiction of image sequences. Spatiotemporal image fusion (STIF) provides a solution to generate images with both high-spatial and high-temporal resolutions, thus expanding the applications of existing satellite images. Most deep learning-based STIF methods throw the task to network as a whole and construct an end-to-end model without caring about the intermediate physical process. This leads to high complexity, less interpretability, and low accuracy of the fusion model. To address this problem, we propose a two-stream difference transformation spatiotemporal fusion (TSDTSF), which includes transformation and fusion streams. In the transformation stream, an image difference transformation module reduces the pixel distribution difference of images from different sensors with the same spatial resolution, and a feature difference transformation module improves the feature quality of low-resolution images. The fusion stream focuses on feature fusion and image reconstruction. The TSDTSF shows superior performance in accuracy, vision quality, and robustness. The experimental results show that TSDTSF achieves the effect of the average coefficient of determination (R-2 = 0.7847) and the root mean square error (RMSE = 0.0266), which is better than the suboptimal method average (R-2 = 0.7519) and (RMSE = 0.0289). The quantitative and qualitative experimental results on various datasets demonstrate our superiority over the state-of-the-art methods. (C) 2022 Society of Photo-Optical Instrumentation Engineers (SPIE)

引用

页数：15

共 50 条

[41] Deformation Flow Based Two-Stream Network for Lip Reading
Xiao, Jingyun
Yang, Shuang
Zhang, Yuanhang
Shan, Shiguang
Chen, Xilin
2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 364 - 370
[42] A Novel Deep Learning-Based Spatiotemporal Fusion Method for Combining Satellite Images with Different Resolutions Using a Two-Stream Convolutional Neural Network
Jia, Duo
Song, Changqing
Cheng, Changxiu
Shen, Shi
Ning, Lixin
Hui, Chun
REMOTE SENSING, 2020, 12 (04)
[43] Two-Stream Xception Structure Based on Feature Fusion for DeepFake Detection
Wang, Bin
Huang, Liqing
Huang, Tianqiang
Ye, Feng
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
[44] Two-Stream Deep Fusion Network Based on VAE and CNN for Synthetic Aperture Radar Target Recognition
Du, Lan
Li, Lu
Guo, Yuchen
Wang, Yan
Ren, Ke
Chen, Jian
REMOTE SENSING, 2021, 13 (20)
[45] Multilabel Text Classification Algorithm Based on Fusion of Two-Stream Transformer
Duan, Lihua
You, Qi
Wu, Xinke
Sun, Jun
ELECTRONICS, 2022, 11 (14)
[46] A Two-Stream Hybrid Spatio-Temporal Fusion Network For sEMG-Based Gesture Recognition
Ruiqi Han
Juan Wang
Jia Wang
Instrumentation, 2024, 11 (04) : 53 - 63
[47] Two-Stream Xception Structure Based on Feature Fusion for DeepFake Detection
Bin Wang
Liqing Huang
Tianqiang Huang
Feng Ye
International Journal of Computational Intelligence Systems, 16
[48] Robust Detection of Image Operator Chain with Two-Stream Convolutional Neural Network
Liao, Xin
Li, Kaide
Zhu, Xinshan
Liu, K. J. Ray
IEEE Journal on Selected Topics in Signal Processing, 2020, 5 (955-968): : 955 - 968
[49] Video classification by fusing two-stream image template classification and pretrained network
Zebhi, Saeedeh
AlModarresi, Seyed M. T.
Abootalebi, Vahid
JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (05)
[50] Two-stream Attentive CNNs for Image Retrieval
Yang, Fei
Li, Jia
Wei, Shikui
Zheng, Qinjie
Liu, Ting
Zhao, Yao
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1513 - 1521

← 1 2 3 4 5 →