Completing the machine learning saga in fractional snow cover estimation from MODIS Terra reflectance data: Random forests versus support vector regression

被引:69
作者
Kuter, Semih [1 ]
机构
[1] Cankiri Karatekin Univ, Fac Forestry, Dept Forest Engn, TR-18200 Cankiri, Turkey
关键词
Fractional snow cover mapping; Multivariate adaptive regression splines; Artificial neural networks; Support vector machines; MODIS Terra; Landsat; 8; Machine learning; Remote sensing of snow; Alps; ARTIFICIAL NEURAL-NETWORK; REMOTE-SENSING DATA; LAND-COVER; TEMPERATURE ESTIMATION; TRAINING DATA; MISSING DATA; GRAIN-SIZE; CLASSIFICATION; IMAGE; RESOLUTION;
D O I
10.1016/j.rse.2021.112294
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study; i) investigates the suitability of two frequently employed machine learning algorithms in remote sensing, namely, random forests (RFs) and support vector regression (SVR) for fractional snow cover (FSC) estimation from MODIS Terra data, and ii) compares them with the previously proposed artificial neural networks (ANNs) and multivariate adaptive regression splines (MARS) methods over an heterogeneous and complex alpine terrain. The dataset comprises 20 Landsat 8 - MODIS image pairs that belong to European Alps acquired from Apr 2013 to Dec 2016. The fifteen image pairs are used to generate the training dataset necessary to build the models, whereas the remaining five are employed as a separate test dataset. The reference FSC maps are derived from the binary classified Landsat 8 snow/no snow maps at 30 m resolution. In order to assess the effect of sampling type and sample size, nine different training datasets are generated. The RF and SVR models are trained accordingly by using various settings of model tuning parameters. During the training of the models, MODIS top-of-atmosphere reflectance values of bands 1-7, NDSI, NDVI and land cover class are input as independent variables (i.e., predictors) to estimate the dependent variable (i.e., response), i.e., FSC value. The resolution of the generated FSC maps is 500 m. The results indicate that the ANN, MARS, RF and SVR models exhibit high consistency with reference FSC values as indicated by low RMSE (similar to 0.14) and high R (similar to 0.93) values. In order to analyze the effect of using three auxiliary variables, i.e., NDSI, NDVI and land cover class, to the predictive ability of the models; ANN, MARS, RF and SVR models are also trained without these predictor variables, i.e., by only using MODIS bands 1-7. The models trained without three auxiliary variables slightly differ from the ones trained with the full set of predictors by only resulting in a mean decrease in R <0.012 and a mean increase in RMSE <0.009, showing that they perform well in solving the complex functional dependencies by only using MODIS reflectance data. In terms of computational efficiencies of the proposed algorithms measured by the CPU times spent during model training, MARS and RF algorithms outperform ANN and SVR methods.
引用
收藏
页数:30
相关论文
共 124 条
[1]   Detecting Sirex noctilio grey-attacked and lightning-struck pine trees using airborne hyperspectral data, random forest and support vector machines classifiers [J].
Abdel-Rahman, Elfatih M. ;
Mutanga, Onisimo ;
Adam, Elhadi ;
Ismail, Riyad .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 88 :48-59
[2]   Change detection using adaptive fuzzy neural networks: Environmental damage assessment after the Gulf War [J].
Abuelgasim, AA ;
Ross, WD ;
Gopal, S ;
Woodcock, CE .
REMOTE SENSING OF ENVIRONMENT, 1999, 70 (02) :208-223
[3]  
Ackerman S., 2010, Cooperative Institute for Meteorological Satellite Studies
[4]   CMARS and GAM & CQP-Modern optimization methods applied to international credit default prediction [J].
Alp, Ozge Sezgin ;
Buyukbebeci, Erkan ;
Cekic, Aysegul Iscanoglu ;
Ozkurt, Fatma Yerlikaya ;
Taylan, Pakize ;
Weber, Gerhard-Wilhelm .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2011, 235 (16) :4639-4651
[5]  
[Anonymous], WIRES DATA MINING KN, V2, P493
[6]  
[Anonymous], 2002, Statistical Methods for the Analysis of Repeated Measurements
[7]  
[Anonymous], 1990, Neurocomputing
[8]  
[Anonymous], 2012, OVERVIEW RANDOM FORE
[9]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[10]  
[Anonymous], REMOTE SENS ENV, V95, P77