Constrained spanning tree algorithms for irregularly-shaped spatial clustering

被引:37
作者
Costa, Marcelo Azevedo [1 ]
Assuncao, Renato Martins [1 ]
Kulldorff, Martin [2 ,3 ]
机构
[1] Univ Fed Minas Gerais, Dept Stat, BR-31270901 Belo Horizonte, MG, Brazil
[2] Harvard Univ, Sch Med, Dept Populat Med, Cambridge, MA 02138 USA
[3] Harvard Pilgrim Hlth Care Inst, Wellesley, MA USA
关键词
Spatial statistics; Scan statistics; Cluster detection; SCAN; TESTS;
D O I
10.1016/j.csda.2011.11.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Spatial clustering methodologies that are capable of detecting and delineating irregular clusters can provide information about the geographical spread of various diseases under surveillance. This paper proposes and compares three spatial scan statistics designed to detect clusters with irregular shapes. The proposed methods use geographical boundary information to construct a graph in which a cluster growing process is performed based on likelihood function maximization. Constraints on cluster shape are imposed through early stopping, a double connection requirement and a maximum linkage criteria. The methods are evaluated using simulated data sets with either circular or irregular clusters, and compared to the circular and elliptic scan statistics. Results show that for circular clusters, the standard circular scan statistic is optimal, as expected. The double connection, elliptic maximum linkage scan statistics also achieve good results. For irregularly-shaped clusters, the elliptic, maximum linkage and double connected scan statistics are optimal for different cluster models and by different evaluation criteria, but the circular scan statistic also performs well. If the emphasis is on statistical power for cluster detection, the simple circular scan statistic is attractive across the board choice. If the emphasis is on the accurate determination of cluster size, shape and boundaries, the double connected, maximum linkage and elliptical scan statistics are often more suitable choices. All methods perform well though, with the exception of the unrestricted dynamic minimum spanning tree scan statistic and the early stopping scan statistic, which we do not recommend. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1771 / 1783
页数:13
相关论文
共 13 条
[1]   Fast detection of arbitrarily shaped disease clusters [J].
Assunçao, R ;
Costa, M ;
Tavares, A ;
Ferreira, S .
STATISTICS IN MEDICINE, 2006, 25 (05) :723-742
[2]   THE DETECTION OF CLUSTERS IN RARE DISEASES [J].
BESAG, J ;
NEWELL, J .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1991, 154 :143-155
[3]   A fair comparison between the spatial scan and the Besag-Newell Disease clustering tests [J].
Costa, MA ;
Assunçao, RM .
ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2005, 12 (03) :301-319
[4]   Evaluation of spatial scan statistics for irregularly shaped clusters [J].
Duczmal, L ;
Kulldorff, M ;
Huang, L .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (02) :428-442
[5]   A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters [J].
Duczmal, L ;
Assunçao, R .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (02) :269-286
[6]  
Duczmal L, 2006, P 8 BRAZ S GEOINF
[7]   MODIFIED RANDOMIZATION TESTS FOR NONPARAMETRIC HYPOTHESES [J].
DWASS, M .
ANNALS OF MATHEMATICAL STATISTICS, 1957, 28 (01) :181-187
[8]   A spatial scan statistic [J].
Kulldorff, M .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1997, 26 (06) :1481-1496
[9]   Power comparisons for disease clustering tests [J].
Kulldorff, M ;
Tango, T ;
Park, PJ .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 42 (04) :665-684
[10]   An elliptic spatial scan statistic [J].
Kulldorff, Martin ;
Huang, Lan ;
Pickle, Linda ;
Duczmal, Luiz .
STATISTICS IN MEDICINE, 2006, 25 (22) :3929-3943