Detecting influential observations in a model-based cluster analysis

被引:2
作者
Bruckers, Liesbeth [1 ]
Molenberghs, Geert [1 ,2 ]
Verbeke, Geert [1 ,2 ]
Geys, Helena [3 ]
机构
[1] Univ Hasselt, I BioStat, Hasselt, Belgium
[2] Univ Leuven, I Biostat, Leuven, Belgium
[3] Janssen Pharmaceut, Beerse, Belgium
关键词
Local influence; model-based clustering; finite mixture model; LOCAL INFLUENCE; DIAGNOSTICS; POINTS;
D O I
10.1177/0962280216634112
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Finite mixture models have been used to model population heterogeneity and to relax distributional assumptions. These models are also convenient tools for clustering and classification of complex data such as, for example, repeated-measurements data. The performance of model-based clustering algorithms is sensitive to influential and outlying observations. Methods for identifying outliers in a finite mixture model have been described in the literature. Approaches to identify influential observations are less common. In this paper, we apply local-influence diagnostics to a finite mixture model with known number of components. The methodology is illustrated on real-life data.
引用
收藏
页码:521 / 540
页数:20
相关论文
共 32 条
[1]  
Agresti A., 2003, CATEGORICAL DATA ANA
[2]  
[Anonymous], 1993, An introduction to the bootstrap
[3]   DIAGNOSTICS FOR MIXED-MODEL ANALYSIS OF VARIANCE [J].
BECKMAN, RJ ;
NACHTSHEIM, CJ ;
COOK, RD .
TECHNOMETRICS, 1987, 29 (04) :413-426
[4]  
Cerioli A, 1998, STUDIES CLASSIFICATI, P15
[5]  
Cheng R, 1996, J CLASSIF, V13, P315
[6]  
Cook R., 1982, Residuals and Influence in Regression
[7]  
COOK RD, 1986, J ROY STAT SOC B MET, V48, P133
[8]  
Cuesta-Albertos JA, 1997, ANN STAT, V25, P553
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]  
Hosmer W., 2000, Applied Logistic Regression, VSecond