Nonparametric machine learning for mapping forest cover and exploring influential factors

被引:15
作者
Liu, Bao [1 ]
Gao, Lei [2 ,3 ]
Li, Baoan [1 ]
Marcos-Martinez, Raymundo [4 ]
Bryan, Brett A. [5 ]
机构
[1] China Univ Petr East China, Coll Informat & Control Engn, Qingdao 266580, Peoples R China
[2] CSIRO, Waite Campus, Urrbrae, SA 5064, Australia
[3] Shandong Normal Univ, Sch Business, Jinan 250014, Peoples R China
[4] CSIRO, Csiro, ACT 2601, Australia
[5] Deakin Univ, Ctr Integrat Ecol, Melbourne Burwood Campus, Burwood, Vic 3125, Australia
关键词
Machine learning; Support vector regression; Artificial neural network; Random forest; Gradient boosted regression tree; Forest cover; SUPPORT VECTOR REGRESSION; LAND-USE; SENSITIVITY-ANALYSIS; LOGISTIC-REGRESSION; CONTINUOUS FIELDS; DEEP UNCERTAINTY; NEURAL-NETWORKS; TREE-COVER; DYNAMICS; CLASSIFICATION;
D O I
10.1007/s10980-020-01046-0
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Context The contribution of forest ecosystem services to human well-being varies over space following the dynamics in forest cover. Use of machine learning models is increasing in projecting forest cover changes and investigating the drivers, yet references are still lacking for selecting machine learning models for spatial projection of forest cover patterns. Objectives We assessed the ability of nonparametric machine learning techniques to project the spatial distribution of forest cover and identify its drivers using a case study of Tasmania, Australia. Methods We developed, evaluated, and compared the performance of four nonparametric machine learning models: support vector regression (SVR), artificial neural networks (ANN), random forest (RF), and gradient boosted regression trees (GBRT). Results The results demonstrated that RF far outperformed the other three models in both fitting and projection accuracy, and required less computional costs. GBRT outperformed SVR and ANN in projection accuracy. However, RF exhibited serious overfitting due to the full growth of its decision trees. The influence rankings of explanatory variables on spatial patterns of forest cover were different under the four models. Land tenure type and rainfall were identified among the top four most influential variables by all four models. The ranking produced by the RF model was significantly different with topographic factors associated with land clearing and production costs (elevation and distance to timber facilities) being the two most influential variables. Conclusions We encourage practitioners to consider nonparametric machine learning methods, especially RF, when facing problems of complex environmental data modelling.
引用
收藏
页码:1683 / 1699
页数:17
相关论文
共 91 条
  • [1] ACLEP, 2014, NAT SOIL DAT PROV AU
  • [2] Measuring the effectiveness of protected area networks in reducing deforestation
    Andam, Kwaw S.
    Ferraro, Paul J.
    Pfaff, Alexander
    Sanchez-Azofeifa, G. Arturo
    Robalino, Juan A.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (42) : 16089 - 16094
  • [3] [Anonymous], 2016, ACC REM IND AUSTR AR
  • [4] [Anonymous], 2015, CLIM DAT ONL
  • [5] [Anonymous], 2014, METAANALYSIS SPATIAL
  • [6] [Anonymous], TEN AUSTR FOR 2013 V
  • [7] [Anonymous], 2014, COLL AUSTR PROT AR D
  • [8] Spatial and temporal dynamics of land use pattern in Turkey: A case study in Inegol
    Baskent, Emin Zeki
    Kadiogullari, Ali Ihsan
    [J]. LANDSCAPE AND URBAN PLANNING, 2007, 81 (04) : 316 - 327
  • [9] A gradient boosting approach to the Kaggle load forecasting competition
    Ben Taieb, Souhaib
    Hyndman, Rob J.
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2014, 30 (02) : 382 - 394
  • [10] Bergstra J, 2012, J MACH LEARN RES, V13, P281