Geographical random forests: a spatial extension of the random forest algorithm to address spatial heterogeneity in remote sensing and population modelling

被引:286
作者
Georganos, Stefanos [1 ]
Grippa, Tais [1 ]
Gadiaga, Assane Niang [2 ]
Linard, Catherine [2 ]
Lennert, Moritz [1 ]
Vanhuysse, Sabine [1 ]
Mboga, Nicholus [1 ]
Wolff, Eleonore [1 ]
Kalogirou, Stamatis [3 ]
机构
[1] ULB, Dept Geosci Environm & Soc, Brussels, Belgium
[2] Univ Namur, Inst Life Earth & Environm, Namur, Belgium
[3] Harokopio Univ, Dept Geog, Kallithea, Greece
关键词
Random forest; spatial analysis; population estimation; NON-STATIONARITY; WEIGHTED REGRESSION; NDVI; IMAGERY;
D O I
10.1080/10106049.2019.1595177
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Machine learning algorithms such as Random Forest (RF) are being increasingly applied on traditionally geographical topics such as population estimation. Even though RF is a well performing and generalizable algorithm, the vast majority of its implementations is still 'aspatial' and may not address spatial heterogenous processes. At the same time, remote sensing (RS) data which are commonly used to model population can be highly spatially heterogeneous. From this scope, we present a novel geographical implementation of RF, named Geographical Random Forest (GRF) as both a predictive and exploratory tool to model population as a function of RS covariates. GRF is a disaggregation of RF into geographical space in the form of local sub-models. From the first empirical results, we conclude that GRF can be more predictive when an appropriate spatial scale is selected to model the data, with reduced residual autocorrelation and lower Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE) values. Finally, and of equal importance, GRF can be used as an effective exploratory tool to visualize the relationship between dependent and independent variables, highlighting interesting local variations and allowing for a better understanding of the processes that may be causing the observed spatial heterogeneity.
引用
收藏
页码:121 / 136
页数:16
相关论文
共 36 条
[1]   DMSP/OLS night-time light imagery for urban population estimates in the Brazilian Amazon [J].
Amaral, S ;
Monteiro, AMV ;
Camara, G ;
Quintanilha, JA .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2006, 27 (5-6) :855-870
[2]  
[Anonymous], 2018, R Foundation for Statistical Computing
[3]  
ANSD, 2013, RAPP DEF RGPHAE 2013
[5]  
BORDERON M, 2013, SOCIAL VULNERABILITY, P108
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Using Local Climate Zones in Sub-Saharan Africa to tackle urban health issues [J].
Brousse, Oscar ;
Georganos, Stefanos ;
Demuzere, Matthias ;
Vanhuysse, Sabine ;
Wouters, Hendrik ;
Wolff, Eleonore ;
Linard, Catherine ;
van Lipzig, Nicole P-M ;
Dujardin, Sebastien .
URBAN CLIMATE, 2019, 27 :227-242
[8]   Geographically weighted regression - modelling spatial non-stationarity [J].
Brunsdon, C ;
Fotheringham, S ;
Charlton, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1998, 47 :431-443
[9]   Incorporating spatial non-stationarity to improve dasymetric mapping of population [J].
Cockx, Kasper ;
Canters, Frank .
APPLIED GEOGRAPHY, 2015, 63 :220-230
[10]   Geographical weighting as a further refinement to regression modelling: An example focused on the NDVI-rainfall relationship [J].
Foody, GM .
REMOTE SENSING OF ENVIRONMENT, 2003, 88 (03) :283-293