The BACON-EEM algorithm for multivariate outlier detection in incomplete survey data

被引:0
|
作者
Beguin, Cedric [1 ]
Hulliger, Beat [2 ]
机构
[1] Univ Neuchatel, CH-2010 Neuchatel, Switzerland
[2] Univ Appl Sci NW Switzerland, CH-4600 Olten, Switzerland
关键词
forward search method; outlier detection; multivariate data; missing value; sampling; robustness; E-M algorithm;
D O I
暂无
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
With complete multivariate data the BACON algorithm (Billor, Hadi and Vellemann 2000) yields a robust estimate of the covariance matrix. The corresponding Mahalanobis distance may be used for multivariate outlier detection. When items are missing the EM algorithm is a convenient way to estimate the covariance matrix at each iteration step of the BACON algorithm. In finite population sampling the EM algorithm must be enhanced to estimate the covariance matrix of the population rather than of the sample. A version of the EM algorithm for survey data following a multivariate normal model, the EEM algorithm (Estimated Expectation Maximization), is proposed. The combination of the two algorithms, the BACON-EEM algorithm, is applied to two datasets and compared with alternative methods.
引用
收藏
页码:91 / 103
页数:13
相关论文
共 50 条