Methods for Estimating Population Density in Data-Limited Areas: Evaluating Regression and Tree-Based Models in Peru

被引:23
作者
Anderson, Weston [1 ,3 ]
Guikema, Seth [1 ]
Zaitchik, Ben [2 ]
Pan, William [4 ,5 ]
机构
[1] Johns Hopkins Univ, Dept Geog & Environm Engn, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Dept Earth & Planetary Sci, Baltimore, MD 21218 USA
[3] Int Food Policy Res Inst, Washington, DC 20036 USA
[4] Duke Univ, Nicholas Sch Environm, Durham, NC 27708 USA
[5] Duke Univ, Duke Global Hlth Inst, Durham, NC USA
关键词
D O I
10.1371/journal.pone.0100037
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies.
引用
收藏
页数:15
相关论文
共 40 条
[1]   A Contemporary Assessment of Change in Humid Tropical Forests [J].
Asner, Gregory P. ;
Rudel, Thomas K. ;
Aide, T. Mitchell ;
Defries, Ruth ;
Emerson, Ruth .
CONSERVATION BIOLOGY, 2009, 23 (06) :1386-1395
[2]   SimBritain: A spatial microsimulation approach to population dynamics [J].
Ballas, D ;
Clarke, G ;
Dorling, D ;
Eyre, H ;
Thomas, B ;
Rossiter, D .
POPULATION SPACE AND PLACE, 2005, 11 (01) :13-34
[3]   AN ERROR-COMPONENTS MODEL FOR PREDICTION OF COUNTY CROP AREAS USING SURVEY AND SATELLITE DATA [J].
BATTESE, GE ;
HARTER, RM ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (401) :28-36
[4]  
BOUDOT Y, 1993, EARSEL ADV REMOTE SE, V3
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[7]  
Cameron A. C., 2005, Microeconometrics: Methods and Applications
[8]   M-quantile models for small area estimation [J].
Chambers, Ray ;
Tzavidis, Nikos .
BIOMETRIKA, 2006, 93 (02) :255-268
[9]   BART: BAYESIAN ADDITIVE REGRESSION TREES [J].
Chipman, Hugh A. ;
George, Edward I. ;
McCulloch, Robert E. .
ANNALS OF APPLIED STATISTICS, 2010, 4 (01) :266-298
[10]  
*CIESIN COL U, 2005, POV MAPP PROJ SMALL