A Mixture Model-Based Combination Approach for Outlier Detection

被引:4
|
作者
Bouguessa, Mohamed [1 ]
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Outlier detection; ensemble construction; unsupervised learning; mixture model; multivariate beta;
D O I
10.1142/S0218213014600215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an approach that combines different outlier detection algorithms in order to gain an improved effectiveness. To this end, we first estimate an outlier score vector for each data object. Each element of the estimated vectors corresponds to an outlier score produced by a specific outlier detection algorithm. We then use the multivariate beta mixture model to cluster the outlier score vectors into several components so that the component that corresponds to the outliers can be identified. A notable feature of the proposed approach is the automatic identification of outliers, while most existing methods return only a ranked list of points, expecting the outliers to come first; or require empirical threshold estimation to identify outliers. Experimental results, on both synthetic and real data sets, show that our approach substantially enhances the accuracy of outlier base detectors considered in the combination and overcome their drawbacks.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A Model-Based Approach for Outlier Detection in Sensor Networks
    Ding, Min
    Liang, Qilian
    Cheng, Xiuzhen
    Al-Rodhaan, Mznah
    Al-Dhelaan, Abdullah
    Huang, Scott C. -H.
    Chen, Dechang
    AD HOC & SENSOR WIRELESS NETWORKS, 2011, 12 (3-4) : 275 - 293
  • [2] A Model-based Approach for Text Clustering with Outlier Detection
    Yin, Jianhua
    Wang, Jianyong
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 625 - 636
  • [3] Model-based clustering and outlier detection with missing data
    Tong, Hung
    Tortora, Cristina
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 5 - 30
  • [4] Model-based clustering and outlier detection with missing data
    Hung Tong
    Cristina Tortora
    Advances in Data Analysis and Classification, 2022, 16 : 5 - 30
  • [5] Model-Based Outlier Detection System with Statistical Preprocessing
    Singh, Asir Antony Gnana
    Leavline, Jebalamar
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2016, 15 (01) : 789 - 801
  • [6] Cloud model-based outlier detection algorithm for categorical data
    Lei, Dajiang
    Zhang, Liping
    Zhang, Lisheng
    Lei, D. (leidj@cqupt.edu.cn), 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (06): : 199 - 214
  • [7] Missing Values and Directional Outlier Detection in Model-Based Clustering
    Tong, Hung
    Tortora, Cristina
    JOURNAL OF CLASSIFICATION, 2024, 41 (03) : 480 - 513
  • [8] Model-based Outlier Detection for Object-Relational Data
    Riahi, Fatemeh
    Schulte, Oliver
    2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 1590 - 1598
  • [9] Outlier detection in traffic data based on the Dirichlet process mixture model
    Ngan, Henry Y. T.
    Yung, Nelson H. C.
    Yeh, Anthony G. O.
    IET INTELLIGENT TRANSPORT SYSTEMS, 2015, 9 (07) : 773 - 781
  • [10] Model-based outlier detection method for time series of process industry
    Su, Weixing
    Zhu, Yunlong
    Hu, Kunyuan
    Liu, Fang
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2012, 33 (09): : 2080 - 2087