Power to Detect Spatial Disturbances under Different Levels of Geographic Aggregation

被引:15
作者
Jeffery, Caroline [1 ]
Ozonoff, Al [1 ,2 ]
White, Laura F. [2 ]
Nuno, Miriam [1 ]
Pagano, Marcello [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Boston Univ, Sch Publ Hlth, Dept Biostat, Boston, MA USA
基金
美国国家卫生研究院;
关键词
CLUSTER DETECTION; CRYPTOSPORIDIUM INFECTION; HEALTH DATA; OUTBREAK; CONFIDENTIALITY; MILWAUKEE; TESTS;
D O I
10.1197/jamia.M2788
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Spatio and/or temporal surveillance systems are designed to monitor the ongoing appearance of disease cases in space and time, and to detect potential disturbances in either dimension. Patient addresses are sometimes reported at some level of geographic aggregation, for example by ZIP code or census tract. While this aggregation has the advantage of protecting patient privacy, it also risks compromising statistical efficiency. This paper investigated the variation in power to detect a change in the spatial distribution in the presence of spatial aggregation. Methods: The authors generated 400,000 spatial datasets with varying location and spread of simulated spatial disturbances, both on a purely synthetic uniform population, and on a heterogeneous population, representing hospital admissions to three community hospitals in Cape Cod, Massachusetts. The authors evaluated the power of the M-statistic to detect spatial disturbances, comparing the use of exact spatial locations versus twelve different levels of aggregation, where the M-statistic is a comparison of two distributions of interpoint distances between locations. Results: When the spread Of Simulated spatial disturbances was contained to a small portion of the study region or affects a large proportion of the population at risk, power was highest when exact locations were reported. If the spatial disturbance was a more modest signal, the best power was attained at an aggregated level. Conclusions: The precision at which patients' locations are reported has the potential to affect the power of detection significantly. J Am Med Inform Assoc. 2009;16:847-854. DOI 10.1197/jamia.M2788.
引用
收藏
页码:847 / 854
页数:8
相关论文
共 40 条
  • [1] Armstrong MP, 1999, STAT MED, V18, P497, DOI 10.1002/(SICI)1097-0258(19990315)18:5<497::AID-SIM45>3.3.CO
  • [2] 2-R
  • [3] The interpoint distance distribution as a descriptor of point patterns, with an application to spatial disease clustering
    Bonetti, M
    Pagano, M
    [J]. STATISTICS IN MEDICINE, 2005, 24 (05) : 753 - 773
  • [4] Using software agents to preserve individual health data confidentiality in micro-scale geographical analyses
    Boulos, MNK
    Cai, Q
    Padget, JA
    Rushton, G
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2006, 39 (02) : 160 - 170
  • [5] CASSA C, 2006, ADV DIS SURVEILLANCE, V1, P10
  • [6] Cox LH, 1996, STAT MED, V15, P1895, DOI 10.1002/(SICI)1097-0258(19960915)15:17<1895::AID-SIM401>3.0.CO
  • [7] 2-W
  • [8] CUZICK J, 1990, J ROY STAT SOC B MET, V52, P73
  • [9] Arbitrarily shaped multiple spatial cluster detection for case event data
    Demattei, Christophe
    Molinari, Nicolas
    Daures, Jean-Pierre
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (08) : 3931 - 3945
  • [10] A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters
    Duczmal, L
    Assunçao, R
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (02) : 269 - 286