Spectral-Temporal Fusion of Satellite Images via an End-to-End Two-Stream Attention With an Effective Reconstruction Network

被引:1
作者
Benzenati, Tayeb [1 ,2 ]
Kessentini, Yousri [1 ]
Kallel, Abdelaziz [1 ]
机构
[1] Digital Res Ctr Sfax, Lab Signals Syst Artificial Intelligence & Network, Sfax 3021, Tunisia
[2] Univ Boumerdes, LIMOSE Lab, Boumerdes 35000, Algeria
关键词
Attention mechanism; convolutional neural network (CNN); image fusion; multisensor image fusion; Planetscope; Sentinel-2; spectral-temporal fusion; LANDSAT; REFLECTANCE; RESOLUTION;
D O I
10.1109/JSTARS.2023.3234722
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to technical and budget constraints on current optical satellites, the acquisition of satellite images with the best resolutions is not practicable. In this article, aiming to produce products with high spectral (HS) and temporal resolutions, we introduced a two-stream spectral-temporal fusion technique based on attention mechanism called STA-Net. STA-Net aims to combine high spectral and low temporal (HSLT) resolution images with low spectral and high temporal (LSHT) resolution images to generate products with the best characteristics. The proposed technique involves two stages. In the first one, two fused images are generated by a two-stream architecture based on residual attention blocks. The temporal difference estimator stream estimates the temporal difference between HS images at desired and neighboring dates. The reflectance difference estimator is the second stream. It predicts the reflectance difference between the input images (HS-LS) to map LS images into HS products. In the second stage, a reconstruction network combines the latter two-stream outputs via an effective learnable weighted-sum strategy. The two-stage model is trained in an end-to-end fashion using an effective loss function to ensure the best fusion quality. To the best of our knowledge, this work represents the first attempt to address the spectral-temporal fusion using an end-to-end deep neural network model. Experimental results conducted on two actual datasets of Sentinel-2 (HSLT:10 spectral bands and long revisit period) and Planetscope (LSHT: four spectral bands and daily images) images, which proved the effectiveness of the proposed technique with respect to baseline technique.
引用
收藏
页码:1308 / 1320
页数:13
相关论文
共 61 条
  • [1] [Anonymous], 2017, P IEEE C COMP VIS PA
  • [2] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
  • [3] Spatiotemporal Image Fusion in Remote Sensing
    Belgiu, Mariana
    Stein, Alfred
    [J]. REMOTE SENSING, 2019, 11 (07)
  • [4] Deforestation detection using multitemporal satellite images
    Candra, Danang Surya
    [J]. FIFTH INTERNATIONAL CONFERENCES OF INDONESIAN SOCIETY FOR REMOTE SENSING: THE REVOLUTION OF EARTH OBSERVATION FOR A BETTER HUMAN LIFE, 2020, 500
  • [5] Chan W, 2016, INT CONF ACOUST SPEE, P4960, DOI 10.1109/ICASSP.2016.7472621
  • [6] Chaudhari S, 2021, Arxiv, DOI [arXiv:1904.02874, DOI 10.48550/ARXIV.1904.02874]
  • [7] Comparison of Spatiotemporal Fusion Models: A Review
    Chen, Bin
    Huang, Bo
    Xu, Bing
    [J]. REMOTE SENSING, 2015, 7 (02) : 1798 - 1835
  • [8] A Two Stream Siamese Convolutional Neural Network For Person Re-Identification
    Chung, Dahjung
    Tahboub, Khalid
    Delp, Edward J.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1992 - 2000
  • [9] Control of goal-directed and stimulus-driven attention in the brain
    Corbetta, M
    Shulman, GL
    [J]. NATURE REVIEWS NEUROSCIENCE, 2002, 3 (03) : 201 - 215
  • [10] Deshmukh Manjusha, 2010, Int. J. Image Process., V4, P484