A spectral-temporal constrained deep learning method for tree species mapping of plantation forests using time series Sentinel-2 imagery

被引:16
作者
Huang, Zehua [1 ]
Zhong, Liheng [2 ]
Zhao, Feng [3 ]
Wu, Jin [4 ,5 ]
Tang, Hao [6 ]
Lv, Zhengang [1 ]
Xu, Binyuan [1 ]
Zhou, Longfei [1 ]
Sun, Rui [1 ]
Meng, Ran [1 ,7 ]
机构
[1] Huazhong Agr Univ, Coll Resources & Environm, Wuhan 430070, Peoples R China
[2] Ant Grp, Hangzhou 311121, Peoples R China
[3] Cent China Normal Univ, Coll Urban & Environm Sci, Wuhan 430079, Peoples R China
[4] Univ Hong Kong, Sch Biol Sci, Hong Kong, Peoples R China
[5] Chinese Univ Hong Kong, State Key Lab Agrobiotechnol, Shatin, Hong Kong, Peoples R China
[6] Natl Univ Singapore, Dept Geog, Singapore, Singapore
[7] HIT Artificial Intelligence Res Inst Co Ltd, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Tree species mapping; Key phenological stage; Transformer; Attention mechanism; Deep learning; Plantation forests; RECURRENT NEURAL-NETWORKS; LAND-COVER CLASSIFICATION; CROP CLASSIFICATION; FEATURE-SELECTION; NATIONAL-SCALE; MODIS NDVI; LIDAR; PERFORMANCE; ATTRIBUTES; REGRESSION;
D O I
10.1016/j.isprsjprs.2023.09.009
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Plantation forests provide critical ecosystem services and have experienced worldwide expansion during the past few decades. Accurate mapping of tree species through remote sensing is critical for managing plantation forests. The typical temporal behaviors and traits of tree species in satellite image time series (SITS) generate temporal and spectral features in multiple phenological stages that are critical to improve tree species mapping. However, the diverse input features, sequential relations and complex structures in SITS drastically increase the dimension and difficulty of spectral-temporal feature extraction, which challenges the capacity of many general classifiers not explicitly adapted for spectral-temporal learning. As a result, there is still a lack of a method that could automatically extract spectral-temporal features with high separability and regional adaptability from highdimensional SITS for tree species mapping of plantation forests. Moreover, the effects of varying temporal resolution and feature combination on the plantation tree species mapping are under-explored. Here, we developed a multi-head attention-based method for automatically extracting spectral-temporal features with high separability based on a modified Transformer network (Transformer4SITS) for improved plantation tree species mapping. The end-to-end network model consists of a feature extraction module to learn deep spectral-temporal features from SITS and a fusion module to combine multiple features for improving mapping accuracy. We applied this method to two representative plantation forests in southern and northern China for tree species mapping. The results show that: (1) Transformer4SITS method could self-adaptively extract typical spectraltemporal features of key phenological stages (e.g., greenness rising and falling) from SITS, and achieved significantly improved accuracies by at most 15% in comparison with all four baseline methods (i.e., long shortterm memory, harmonic analysis, time-weighted dynamic time warping, linear discriminant analysis); (2) time series with higher temporal resolution tended to produce more accurate species maps consistently across two sites, with their overall accuracies (OA) respectively increasing from 91.05% and 84.33% (60-day) to 94.88% and 88.72% (5-day), but the effect of high temporal resolution respectively leveled off around 90-day and 50-day resolution across two sites; (3) the mapping results using all available bands and two-band spectral indices outperformed the results using a subset of them, but with only modest increase in the accuracy (i.e., OA increased from 93.63% and 86.01% to 94.88% and 88.72%. This study thus provides a state-of-the-art deep learning-based method for improved tree species mapping, which is critical for sustainable management and biodiversity monitoring of plantation forests across large scales.
引用
收藏
页码:397 / 420
页数:24
相关论文
共 137 条
[1]  
[Anonymous], 2020, GLOBAL FOREST RESOUR, DOI [DOI 10.4060/CA9825EN, 10.4060/ca8753en, DOI 10.4060/CA8753EN, DOI 10.4060/CA9825-N]
[2]   Segmentation-aided classification of hyperspectral data using spatial dependency of spectral bands [J].
Appice, Annalisa ;
Malerba, Donato .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 :215-231
[3]  
ArcGIS, 2022, Spatial Autocorrelation (Global Moran's I) (Spatial Statistics)
[4]  
ArcGIS, 2022, What is a z-score? What is a p-value?
[5]   Hyperspectral Image Classification Based on Multibranch Attention Transformer Networks [J].
Bai, Jing ;
Wen, Zheng ;
Xiao, Zhu ;
Ye, Fawang ;
Zhu, Yongdong ;
Alazab, Mamoun ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[6]   Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis [J].
Bandos, Tatyana V. ;
Bruzzone, Lorenzo ;
Camps-Valls, Gustavo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (03) :862-873
[7]  
Basu M., 2006, Data Complexity in Pattern Recognition, DOI DOI 10.1007/978-1-84628-172-3
[8]  
Belgiu M., 2020, The International Archives of the Photogrammetry. Rem. Sens. Spatial Inf. Sci., V43, P947, DOI DOI 10.5194/ISPRS-ARCHIVES-XLIII-B3-2020-947-2020
[9]   Sentinel-2 cropland mapping using pixel-based and object-based time-weighted dynamic time warping analysis [J].
Belgiu, Mariana ;
Csillik, Ovidiu .
REMOTE SENSING OF ENVIRONMENT, 2018, 204 :509-523
[10]   ESA's sentinel missions in support of Earth system science [J].
Berger, Michael ;
Moreno, Jose ;
Johannessen, Johnny A. ;
Levelt, Pieternel F. ;
Hanssen, Ramon F. .
REMOTE SENSING OF ENVIRONMENT, 2012, 120 :84-90