Multivariate outlier detection in incomplete survey data:: the epidemic algorithm and transformed rank correlations

被引:8
|
作者
Béguin, C
Hulliger, B [1 ]
机构
[1] Swiss Fed Stat Off, Stat Methods Unit, CH-2010 Neuchatel, Switzerland
[2] Univ Neuchatel, CH-2000 Neuchatel, Switzerland
关键词
data depth; missing value; multivariate data; outlier detection; robustness; sampling weight;
D O I
10.1046/j.1467-985X.2003.00753.x
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
As a part of the EUREDIT project new methods to detect multivariate outliers in incomplete survey data have been developed. These methods are the first to work with sampling weights and to be able to cope with missing values. Two of these methods are presented here. The epidemic algorithm simulates the propagation of a disease through a population and uses extreme infection times to find outlying observations. Transformed rank correlations are robust estimates of the centre and the scatter of the data. They use a geometric transformation that is based on the rank correlation matrix. The estimates are used to define a Mahalanobis distance that reveals outliers. The two methods are applied to a small data set and to one of the evaluation data sets of the EUREDIT project.
引用
收藏
页码:275 / 294
页数:20
相关论文
共 50 条
  • [21] Robust Outlier Detection Method For Multivariate Spatial Data
    Sweta Shukla
    S. Lalitha
    National Academy Science Letters, 2021, 44 : 551 - 554
  • [22] Treatment of Multivariate Outliers in Incomplete Business Survey Data
    Bill, Marc
    Hulliger, Beat
    AUSTRIAN JOURNAL OF STATISTICS, 2016, 45 (01) : 3 - 23
  • [23] A Survey of Outlier Detection Algorithms for Data Streams
    Tamboli, Jinita
    Shukla, Madhu
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3535 - 3540
  • [24] Outlier detection over data streams: Survey
    Brahmi Z.
    Souiden I.
    International Journal of Business Intelligence and Data Mining, 2021, 19 (04) : 481 - 507
  • [25] Outlier detection for multivariate time series: A functional data approach
    Lopez-Oriona, Angel
    Vilar, Jose A.
    KNOWLEDGE-BASED SYSTEMS, 2021, 233
  • [26] Shape-based outlier detection in multivariate functional data
    Lejeune, Clement
    Mothe, Josiane
    Soubki, Adil
    Teste, Olivier
    KNOWLEDGE-BASED SYSTEMS, 2020, 198
  • [27] Comparison of local outlier detection techniques in spatial multivariate data
    Ernst, Marie
    Haesbroeck, Gentiane
    DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (02) : 371 - 399
  • [28] Multivariate Outlier Detection for Forest Fire Data Aggregation Accuracy
    Alkhatib, Ahmad A. A.
    Abed-Al, Qusai
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (02): : 1071 - 1087
  • [29] Pointwise data depth for univariate and multivariate functional outlier detection
    Jimenez-Varon, Cristian F.
    Harrou, Fouzi
    Sun, Ying
    ENVIRONMETRICS, 2024, 35 (05)
  • [30] Spatial Functional Outlier Detection in Multivariate Spatial Functional Data
    Ali, Nur Ftihah Mohd
    Yunus, Rossita Mohamad
    Mohamed, Ibrahim
    Othman, Faridah
    SAINS MALAYSIANA, 2024, 53 (06): : 1463 - 1476