Two-stream spatiotemporal image fusion network based on difference transformation

被引:0
|
作者
Fang, Shuai [1 ,2 ]
Meng, Siyuan [1 ]
Zhang, Jing [1 ]
Cao, Yang [3 ,4 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Hefei, Peoples R China
[2] Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei, Peoples R China
[3] Univ Sci & Technol China, Dept Automat, Hefei, Peoples R China
[4] Univ Sci & Technol China, Inst Adv Technol, Hefei, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
spatiotemporal image fusion; convolutional neural network; deep learning; remote sensing; REFLECTANCE FUSION; LANDSAT; MODIS; DYNAMICS; MODEL; NDVI;
D O I
10.1117/1.JRS.16.038506
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
For satellite imaging instruments, the tradeoff between spatial and temporal resolution leads to the spatial-temporal contradiction of image sequences. Spatiotemporal image fusion (STIF) provides a solution to generate images with both high-spatial and high-temporal resolutions, thus expanding the applications of existing satellite images. Most deep learning-based STIF methods throw the task to network as a whole and construct an end-to-end model without caring about the intermediate physical process. This leads to high complexity, less interpretability, and low accuracy of the fusion model. To address this problem, we propose a two-stream difference transformation spatiotemporal fusion (TSDTSF), which includes transformation and fusion streams. In the transformation stream, an image difference transformation module reduces the pixel distribution difference of images from different sensors with the same spatial resolution, and a feature difference transformation module improves the feature quality of low-resolution images. The fusion stream focuses on feature fusion and image reconstruction. The TSDTSF shows superior performance in accuracy, vision quality, and robustness. The experimental results show that TSDTSF achieves the effect of the average coefficient of determination (R-2 = 0.7847) and the root mean square error (RMSE = 0.0266), which is better than the suboptimal method average (R-2 = 0.7519) and (RMSE = 0.0289). The quantitative and qualitative experimental results on various datasets demonstrate our superiority over the state-of-the-art methods. (C) 2022 Society of Photo-Optical Instrumentation Engineers (SPIE)
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Deformation Flow Based Two-Stream Network for Lip Reading
    Xiao, Jingyun
    Yang, Shuang
    Zhang, Yuanhang
    Shan, Shiguang
    Chen, Xilin
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 364 - 370
  • [42] A Novel Deep Learning-Based Spatiotemporal Fusion Method for Combining Satellite Images with Different Resolutions Using a Two-Stream Convolutional Neural Network
    Jia, Duo
    Song, Changqing
    Cheng, Changxiu
    Shen, Shi
    Ning, Lixin
    Hui, Chun
    REMOTE SENSING, 2020, 12 (04)
  • [43] Two-Stream Xception Structure Based on Feature Fusion for DeepFake Detection
    Wang, Bin
    Huang, Liqing
    Huang, Tianqiang
    Ye, Feng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [44] Two-Stream Deep Fusion Network Based on VAE and CNN for Synthetic Aperture Radar Target Recognition
    Du, Lan
    Li, Lu
    Guo, Yuchen
    Wang, Yan
    Ren, Ke
    Chen, Jian
    REMOTE SENSING, 2021, 13 (20)
  • [45] Multilabel Text Classification Algorithm Based on Fusion of Two-Stream Transformer
    Duan, Lihua
    You, Qi
    Wu, Xinke
    Sun, Jun
    ELECTRONICS, 2022, 11 (14)
  • [46] A Two-Stream Hybrid Spatio-Temporal Fusion Network For sEMG-Based Gesture Recognition
    Ruiqi Han
    Juan Wang
    Jia Wang
    Instrumentation, 2024, 11 (04) : 53 - 63
  • [47] Two-Stream Xception Structure Based on Feature Fusion for DeepFake Detection
    Bin Wang
    Liqing Huang
    Tianqiang Huang
    Feng Ye
    International Journal of Computational Intelligence Systems, 16
  • [48] Robust Detection of Image Operator Chain with Two-Stream Convolutional Neural Network
    Liao, Xin
    Li, Kaide
    Zhu, Xinshan
    Liu, K. J. Ray
    IEEE Journal on Selected Topics in Signal Processing, 2020, 5 (955-968): : 955 - 968
  • [49] Video classification by fusing two-stream image template classification and pretrained network
    Zebhi, Saeedeh
    AlModarresi, Seyed M. T.
    Abootalebi, Vahid
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (05)
  • [50] Two-stream Attentive CNNs for Image Retrieval
    Yang, Fei
    Li, Jia
    Wei, Shikui
    Zheng, Qinjie
    Liu, Ting
    Zhao, Yao
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1513 - 1521