Processing aggregated data: the location of clusters in health data

被引:6
作者
Buchin, Kevin [1 ]
Buchin, Maike [1 ]
van Kreveld, Marc [2 ]
Loeffler, Maarten [3 ]
Luo, Jun [4 ]
Silveira, Rodrigo I. [5 ]
机构
[1] TU Eindhoven, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Univ Utrecht, Dept Comp Sci, NL-3508 TB Utrecht, Netherlands
[3] Univ Calif Irvine, Dept Comp Sci, Irvine, CA USA
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing, Peoples R China
[5] Univ Politecn Cataluna, Dept Matemat Aplicada 2, Catalunya, Spain
关键词
Cluster; Aggregated data; Algorithm; Public health; DISEASE; INTERPOLATION; OUTBREAK;
D O I
10.1007/s10707-011-0143-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spatially aggregated data is frequently used in geographical applications. Often spatial data analysis on aggregated data is performed in the same way as on exact data, which ignores the fact that we do not know the actual locations of the data. We here propose models and methods to take aggregation into account. For this we focus on the problem of locating clusters in aggregated data. More specifically, we study the problem of locating clusters in spatially aggregated health data. The data is given as a subdivision into regions with two values per region, the number of cases and the size of the population at risk. We formulate the problem as finding a placement of a cluster window of a given shape such that a cluster function depending on the population at risk and the cases is maximized. We propose area-based models to calculate the cases (and the population at risk) within a cluster window. These models are based on the areas of intersection of the cluster window with the regions of the subdivision. We show how to compute a subdivision such that within each cell of the subdivision the areas of intersection are simple functions. We evaluate experimentally how taking aggregation into account influences the location of the clusters found.
引用
收藏
页码:497 / 521
页数:25
相关论文
共 36 条
[1]  
[Anonymous], 2000, The mathematica book
[2]  
[Anonymous], 1987, INT J GEOGR INF SYST, DOI [DOI 10.1080/02693798708927821, 10.1080/02693798708927821]
[3]  
[Anonymous], 1997, SOLUTION ECOLOGICAL
[4]  
Armstrong MP, 1999, STAT MED, V18, P497, DOI 10.1002/(SICI)1097-0258(19990315)18:5<497::AID-SIM45>3.3.CO
[5]  
2-R
[6]   Map-making and myth-making in Broad Street: the London cholera epidemic, 1854 [J].
Brody, H ;
Rip, MR ;
Vinten-Johansen, P ;
Paneth, N ;
Rachman, S .
LANCET, 2000, 356 (9223) :64-68
[7]   EVALUATION OF METHODS FOR ECOLOGICAL INFERENCE [J].
CLEAVE, N ;
BROWN, PJ ;
PAYNE, CD .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1995, 158 :55-72
[8]  
Cox LH, 1996, STAT MED, V15, P1895, DOI 10.1002/(SICI)1097-0258(19960915)15:17<1895::AID-SIM401>3.0.CO
[9]  
2-W
[10]  
Cromley E.K., 2002, GIS and Public Health