Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning

被引:36
作者
Arribas-Bel, Daniel [1 ]
Patino, Jorge E. [2 ]
Duque, Juan C. [2 ]
机构
[1] Univ Liverpool, Dept Geog & Planning, Liverpool, Merseyside, England
[2] Univ EAFIT, Dept Econ, Res Spatial Econ RiSE Grp, Medellin, Colombia
关键词
QUALITY-OF-LIFE; URBAN GREEN SPACE; SURFACE-TEMPERATURE; SATELLITE IMAGERY; RESIDENTIAL LAND; GOOGLE EARTH; CENSUS-DATA; POVERTY; NEIGHBORHOOD; VEGETATION;
D O I
10.1371/journal.pone.0176684
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper provides evidence on the usefulness of very high spatial resolution (VHR) imagery in gathering socioeconomic information in urban settlements. We use land cover, spectral, structure and texture features extracted from a Google Earth image of Liverpool (UK) to evaluate their potential to predict Living Environment Deprivation at a small statistical area level. We also contribute to the methodological literature on the estimation of socioeconomic indices with remote-sensing data by introducing elements from modern machine learning. In addition to classical approaches such as Ordinary Least Squares (OLS) regression and a spatial lag model, we explore the potential of the Gradient Boost Regressor and Random Forests to improve predictive performance and accuracy. In addition to novel predicting methods, we also introduce tools for model interpretation and evaluation such as feature importance and partial dependence plots, or cross-validation. Our results show that Random Forest proved to be the best model with an R-2 of around 0.54, followed by Gradient Boost Regressor with 0.5. Both the spatial lag model and the OLS fall behind with significantly lower performances of 0.43 and 0.3, respectively.
引用
收藏
页数:25
相关论文
共 97 条
[1]  
Allen R. G., 1998, FAO Irrigation and Drainage Paper
[2]  
[Anonymous], ANN AM ASS GEOGRAPHE
[3]  
[Anonymous], PREVENTING CHRONIC D
[4]  
[Anonymous], CENS GEOGR OV VAR GE
[5]  
Anselin L., 2014, Modern spatial econometrics in practice: A guide to GeoDa, GeoDaSpace and PySAL
[6]   A SPATIAL CLIFF-ORD-TYPE MODEL WITH HETEROSKEDASTIC INNOVATIONS: SMALL AND LARGE SAMPLE RESULTS [J].
Arraiz, Irani ;
Drukker, David M. ;
Kelejian, Harry H. ;
Prucha, Ingmar R. .
JOURNAL OF REGIONAL SCIENCE, 2010, 50 (02) :592-614
[7]   Recursive partitioning for heterogeneous causal effects [J].
Athey, Susan ;
Imbens, Guido .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7353-7360
[8]   Definition of a comprehensive set of texture semivariogram features and their evaluation for object-oriented image classification [J].
Balaguer, A. ;
Ruiz, L. A. ;
Hermosilla, T. ;
Recio, J. A. .
COMPUTERS & GEOSCIENCES, 2010, 36 (02) :231-240
[9]   Using semivariogram indices to analyse heterogeneity in spatial patterns in remotely sensed images [J].
Balaguer-Beser, A. ;
Ruiz, L. A. ;
Hermosilla, T. ;
Recio, J. A. .
COMPUTERS & GEOSCIENCES, 2013, 50 :115-127
[10]  
Barrett M, 2005, STUD DEV PSYCHOL, P1