A Mixture Model-Based Combination Approach for Outlier Detection

被引:4
|
作者
Bouguessa, Mohamed [1 ]
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Outlier detection; ensemble construction; unsupervised learning; mixture model; multivariate beta;
D O I
10.1142/S0218213014600215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an approach that combines different outlier detection algorithms in order to gain an improved effectiveness. To this end, we first estimate an outlier score vector for each data object. Each element of the estimated vectors corresponds to an outlier score produced by a specific outlier detection algorithm. We then use the multivariate beta mixture model to cluster the outlier score vectors into several components so that the component that corresponds to the outliers can be identified. A notable feature of the proposed approach is the automatic identification of outliers, while most existing methods return only a ranked list of points, expecting the outliers to come first; or require empirical threshold estimation to identify outliers. Experimental results, on both synthetic and real data sets, show that our approach substantially enhances the accuracy of outlier base detectors considered in the combination and overcome their drawbacks.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] A mixture model-based nonparametric approach to estimating a count distribution
    Chee, Chew-Seng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 109 : 34 - 44
  • [22] Robust mixture model-based clustering with genetic algorithm approach
    Nguyen Duc Thang
    Chen, Lihui
    Chan, Chee Keong
    INTELLIGENT DATA ANALYSIS, 2011, 15 (03) : 357 - 373
  • [23] Outlier Identification in Model-Based Cluster Analysis
    Evans, Katie
    Love, Tanzy
    Thurston, Sally W.
    JOURNAL OF CLASSIFICATION, 2015, 32 (01) : 63 - 84
  • [24] Outlier Identification in Model-Based Cluster Analysis
    Katie Evans
    Tanzy Love
    Sally W. Thurston
    Journal of Classification, 2015, 32 : 63 - 84
  • [25] A Model-based Approach to Hyperspectral Change Detection
    Meola, Joseph
    Eismann, Michael T.
    Moses, Randolph L.
    Ash, Joshua N.
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XVI, 2010, 7695
  • [26] Mixture Model-Based Classification
    Zhao, Yunpeng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 881 - 882
  • [27] An Abnormal Login Detection Method Based on Local Outlier Factor and Gaussian Mixture Model
    Guo, Wei
    He, Yue
    Chen, He-Xiong
    Hang, Fei-Lu
    Li, Yun-Jie
    International Journal of Network Security, 2023, 25 (02) : 297 - 305
  • [28] A model-based approach for assessing in vivo combination therapy interactions
    Lopez, AM
    Pegram, MD
    Slamon, DJ
    Landaw, EM
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (23) : 13023 - 13028
  • [29] A Dirichlet Multinomial Mixture Model-based Approach for Short Text Clustering
    Yin, Jianhua
    Wang, Jianyong
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 233 - 242
  • [30] Flexible and efficient model-based congestion detection approach
    Abdelhafid, Zeroual
    Harrou, Fouzi
    Sun, Ying
    2018 6TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2018,