Ensemble Machine Learning on the Fusion of Sentinel Time Series Imagery with High-Resolution Orthoimagery for Improved Land Use/Land Cover Mapping

被引:1
作者
Subedi, Mukti Ram [1 ,2 ]
Portillo-Quintero, Carlos [1 ]
McIntyre, Nancy E. [3 ]
Kahl, Samantha S. [4 ]
Cox, Robert D. [1 ]
Perry, Gad [1 ,5 ]
Song, Xiaopeng [6 ]
机构
[1] Texas Tech Univ, Dept Nat Resources Management, Lubbock, TX 79409 USA
[2] Univ Georgia, Warnell Sch Forestry & Nat Resources, Athens, GA 30602 USA
[3] Texas Tech Univ, Dept Biol Sci, Lubbock, TX 79409 USA
[4] Blackburn Coll, Dept Biol, Carlinville, IL 62626 USA
[5] George Mason Univ, Dept Environm Sci & Policy, 4400 Univ Dr, Fairfax, VA 22030 USA
[6] Univ Maryland, Dept Geog Sci, College Pk, MD 20742 USA
关键词
bagging; boosting; stacking; GEOBIA; autocorrelation; target-oriented cross-validation; data fusion; CLASSIFICATION; ACCURACY; AUTOCORRELATION;
D O I
10.3390/rs16152778
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the United States, several land use and land cover (LULC) data sets are available based on satellite data, but these data sets often fail to accurately represent features on the ground. Alternatively, detailed mapping of heterogeneous landscapes for informed decision-making is possible using high spatial resolution orthoimagery from the National Agricultural Imagery Program (NAIP). However, large-area mapping at this resolution remains challenging due to radiometric differences among scenes, landscape heterogeneity, and computational limitations. Various machine learning (ML) techniques have shown promise in improving LULC maps. The primary purposes of this study were to evaluate bagging (Random Forest, RF), boosting (Gradient Boosting Machines [GBM] and extreme gradient boosting [XGB]), and stacking ensemble ML models. We used these techniques on a time series of Sentinel 2A data and NAIP orthoimagery to create a LULC map of a portion of Irion and Tom Green counties in Texas (USA). We created several spectral indices, structural variables, and geometry-based variables, reducing the dimensionality of features generated on Sentinel and NAIP data. We then compared accuracy based on random cross-validation without accounting for spatial autocorrelation and target-oriented cross-validation accounting for spatial structures of the training data set. Comparison of random and target-oriented cross-validation results showed that autocorrelation in the training data offered overestimation ranging from 2% to 3.5%. The XGB-boosted stacking ensemble on-base learners (RF, XGB, and GBM) improved model performance over individual base learners. We show that meta-learners are just as sensitive to overfitting as base models, as these algorithms are not designed to account for spatial information. Finally, we show that the fusion of Sentinel 2A data with NAIP data improves land use/land cover classification using geographic object-based image analysis.
引用
收藏
页数:17
相关论文
共 53 条
  • [1] Object-based classification of hyperspectral data using Random Forest algorithm
    Amini, Saeid
    Homayouni, Saeid
    Safari, Abdolreza
    Darvishsefat, Ali A.
    [J]. GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (02) : 127 - 138
  • [2] An object-based approach for mapping forest structural types based on low-density LiDAR and multispectral imagery
    Angel Ruiz, Luis
    Abel Recio, Jorge
    Crespo-Peremarch, Pablo
    Sapena, Marta
    [J]. GEOCARTO INTERNATIONAL, 2018, 33 (05) : 443 - 457
  • [3] [Anonymous], 2020, ECognition Developer.
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [6] CONGALTON RG, 1988, PHOTOGRAMM ENG REM S, V54, P593
  • [7] A REVIEW OF ASSESSING THE ACCURACY OF CLASSIFICATIONS OF REMOTELY SENSED DATA
    CONGALTON, RG
    [J]. REMOTE SENSING OF ENVIRONMENT, 1991, 37 (01) : 35 - 46
  • [8] Comparison of bagging, boosting and stacking algorithms for surface soil moisture mapping using optical-thermal-microwave remote sensing synergies
    Das, Bappa
    Rathore, Pooja
    Roy, Debasish
    Chakraborty, Debashis
    Jatav, Raghuveer Singh
    Sethi, Deepak
    Kumar, Praveen
    [J]. CATENA, 2022, 217
  • [9] Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan
    Dou, Jie
    Yunus, Ali P.
    Dieu Tien Bui
    Merghadi, Abdelaziz
    Sahana, Mehebub
    Zhu, Zhongfan
    Chen, Chi-Wen
    Han, Zheng
    Binh Thai Pham
    [J]. LANDSLIDES, 2020, 17 (03) : 641 - 658
  • [10] Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services
    Drusch, M.
    Del Bello, U.
    Carlier, S.
    Colin, O.
    Fernandez, V.
    Gascon, F.
    Hoersch, B.
    Isola, C.
    Laberinti, P.
    Martimort, P.
    Meygret, A.
    Spoto, F.
    Sy, O.
    Marchese, F.
    Bargellini, P.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2012, 120 : 25 - 36