Density estimation and adaptive bandwidths: A primer for public health practitioners

被引:100
作者
Carlos, Heather A. [1 ]
Shi, Xun [2 ]
Sargent, James [1 ]
Tanski, Susanne [1 ]
Berke, Ethan M. [1 ,2 ,3 ,4 ]
机构
[1] Dartmouth Hitchcock Med Ctr, Norris Cotton Canc Ctr, Lebanon, NH 03766 USA
[2] Dartmouth Coll, Dept Geog, Hanover, NH 03755 USA
[3] Dartmouth Coll, Hitchcock Med Ctr, Dartmouth Med Sch, Dept Community & Family Med, Hanover, NH 03756 USA
[4] Dartmouth Inst Hlth Policy & Clin Practice, Prevent Res Ctr Dartmouth, Lebanon, NH USA
基金
美国国家卫生研究院;
关键词
DISEASE; PATTERN;
D O I
10.1186/1476-072X-9-39
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Geographic information systems have advanced the ability to both visualize and analyze point data. While point-based maps can be aggregated to differing areal units and examined at varying resolutions, two problems arise 1) the modifiable areal unit problem and 2) any corresponding data must be available both at the scale of analysis and in the same geographic units. Kernel density estimation (KDE) produces a smooth, continuous surface where each location in the study area is assigned a density value irrespective of arbitrary administrative boundaries. We review KDE, and introduce the technique of utilizing an adaptive bandwidth to address the underlying heterogeneous population distributions common in public health research. Results: The density of occurrences should not be interpreted without knowledge of the underlying population distribution. When the effect of the background population is successfully accounted for, differences in point patterns in similar population areas are more discernible; it is generally these variations that are of most interest. A static bandwidth KDE does not distinguish the spatial extents of interesting areas, nor does it expose patterns above and beyond those due to geographic variations in the density of the underlying population. An adaptive bandwidth method uses background population data to calculate a kernel of varying size for each individual case. This limits the influence of a single case to a small spatial extent where the population density is high as the bandwidth is small. If the primary concern is distance, a static bandwidth is preferable because it may be better to define the "neighborhood" or exposure risk based on distance. If the primary concern is differences in exposure across the population, a bandwidth adapting to the population is preferred. Conclusions: Kernel density estimation is a useful way to consider exposure at any point within a spatial frame, irrespective of administrative boundaries. Utilization of an adaptive bandwidth may be particularly useful in comparing two similarly populated areas when studying health disparities or other issues comparing populations in public health.
引用
收藏
页数:8
相关论文
共 26 条
[1]  
[Anonymous], 2009, The New York Times
[2]  
BERKE EM, 2010, AM J PUBLIC HLTH
[3]  
Bithell JF, 2000, STAT MED, V19, P2203, DOI 10.1002/1097-0258(20000915/30)19:17/18<2203::AID-SIM564>3.0.CO
[4]  
2-U
[5]   AN APPLICATION OF DENSITY-ESTIMATION TO GEOGRAPHICAL EPIDEMIOLOGY [J].
BITHELL, JF .
STATISTICS IN MEDICINE, 1990, 9 (06) :691-701
[6]   ESTIMATING PROBABILITY SURFACES FOR GEOGRAPHICAL POINT DATA - AN ADAPTIVE KERNEL ALGORITHM [J].
BRUNSDON, C .
COMPUTERS & GEOSCIENCES, 1995, 21 (07) :877-894
[7]  
*DMSP DAT COLL US, WORLD STAB LIT
[8]  
Dobson JE, 2000, PHOTOGRAMM ENG REM S, V66, P849
[9]   Spatial point pattern analysis and its application in geographical epidemiology [J].
Gatrell, AC ;
Bailey, TC ;
Diggle, PJ ;
Rowlingson, BS .
TRANSACTIONS OF THE INSTITUTE OF BRITISH GEOGRAPHERS, 1996, 21 (01) :256-274
[10]  
Guagliardo Mark F, 2004, Int J Health Geogr, V3, P3