Supervised and self-supervised learning-based cascade spatiotemporal fusion framework and its application

被引:8
作者
Sun, Weixuan [1 ]
Li, Jie [1 ]
Jiang, Menghui [2 ]
Yuan, Qiangqiang [1 ]
机构
[1] Wuhan Univ, Sch Geodesy & Geomat, Wuhan, Peoples R China
[2] Wuhan Univ, Sch Resource & Environm Sci, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Spatiotemporal fusion; Supervised learning; Self -supervised learning; Dual -stage cascade framework; Land surface temperature; LAND-SURFACE TEMPERATURE; REFLECTANCE FUSION; NEURAL-NETWORK; COVER; MODEL; MODIS; IMAGERY; SERIES;
D O I
10.1016/j.isprsjprs.2023.07.022
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Spatiotemporal fusion (STF) is considered an effective way to address the mutual constraints of the spatiotem-poral resolutions of the remote sensing images from a single satellite sensor. Although the deep learning (DL) -based STF methods have shown great potential so far, there are still some deficiencies. Supervised DL-based methods usually train the network on the original scale data with richer spatial information, but they utilize only the data at auxiliary dates to generate the fusion data at prediction dates, which may perform poorly when significant spatiotemporal changes occur between the auxiliary and prediction dates. For self-supervised DL -based methods, the training can focus more on the data at prediction dates, but the network is generally trained on the down-sampled data with less spatial information, causing insufficient spatial structures of the fusion results. Based on the above, we innovatively combine supervised and self-supervised learning and propose a dual-stage cascade STF framework. In the first stage, a model based on supervised learning is trained on the data at auxiliary dates to extract abundant spatial features and obtain the initial fusion results. In the second stage, we utilize a self-supervised strategy to excavate the spatiotemporal features of prediction dates based on the initial fusion results and the observed data at prediction dates. In addition, to alleviate the insufficient consideration of the temporal correlation in existing STF methods, we design a temporal consistency loss function to fully utilize the temporal correlation information between multiple prediction dates to generate more accurate fusion results. The proposed framework can not only realize the STF of remote sensing images but also be well applied to the STF of remote sensing products such as land surface temperature (LST). Based on the quantitative results of three datasets, the proposed method improves the quantitative indices of root mean square error (RMSE), structural similarity (SSIM), erreur relative global adimensionnelle de synthese (ERGAS), peak signal to noise ratio (PSNR), and spectral angle mapper (SAM) by up to 0.2014, 0.0279, 0.6797, 0.7629, and 0.2006 at most compared with the best comparison method, which fully shows the superiority of our method.
引用
收藏
页码:19 / 36
页数:18
相关论文
共 51 条
[1]   A Spatial and Temporal Nonlocal Filter-Based Data Fusion Method [J].
Cheng, Qing ;
Liu, Huiqing ;
Shen, Huanfeng ;
Wu, Penghai ;
Zhang, Liangpei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4476-4488
[2]   Validation of Collection 6 MODIS land surface temperature product using in situ measurements [J].
Duan, Si-Bo ;
Li, Zhao-Liang ;
Li, Hua ;
Goettsche, Frank-M ;
Wu, Hua ;
Zhao, Wei ;
Leng, Pei ;
Zhang, Xia ;
Coll, Cesar .
REMOTE SENSING OF ENVIRONMENT, 2019, 225 :16-29
[3]   On the blending of the Landsat and MODIS surface reflectance: Predicting daily Landsat surface reflectance [J].
Gao, Feng ;
Masek, Jeff ;
Schwaller, Matt ;
Hall, Forrest .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (08) :2207-2218
[4]   A new data fusion model for high spatial- and temporal-resolution mapping of forest disturbance based on Landsat and MODIS [J].
Hilker, Thomas ;
Wulder, Michael A. ;
Coops, Nicholas C. ;
Linke, Julia ;
McDermid, Greg ;
Masek, Jeffrey G. ;
Gao, Feng ;
White, Joanne C. .
REMOTE SENSING OF ENVIRONMENT, 2009, 113 (08) :1613-1627
[5]   Spatio-temporal reflectance fusion via unmixing: accounting for both phenological and land-cover changes [J].
Huang, Bo ;
Zhang, Hankui .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2014, 35 (16) :6213-6233
[6]   Unified fusion of remote-sensing imagery: generating simultaneously high-resolution synthetic spatialtemporalspectral earth observations [J].
Huang, Bo ;
Zhang, Hankui ;
Song, Huihui ;
Wang, Juan ;
Song, Chunqiao .
REMOTE SENSING LETTERS, 2013, 4 (06) :561-569
[7]   Spatiotemporal Reflectance Fusion via Sparse Representation [J].
Huang, Bo ;
Song, Huihui .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2012, 50 (10) :3707-3716
[8]   Deep-Learning-Based Spatio-Temporal-Spectral Integrated Fusion of Heterogeneous Remote Sensing Images [J].
Jiang, Menghui ;
Shen, Huanfeng ;
Li, Jie .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[9]   Spatiotemporal downscaling approaches for monitoring 8-day 30 m actual evapotranspiration [J].
Ke, Yinghai ;
Im, Jungho ;
Park, Seonyoung ;
Gong, Huili .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 126 :79-93
[10]   Pansharpening Quality Assessment Using the Modulation Transfer Functions of Instruments [J].
Khan, Muhammad Murtaza ;
Alparone, Luciano ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (11) :3880-3891