Unsupervised outlier detection using random subspace and subsampling ensembles of Dirichlet process mixtures

被引:2
作者
Kim, Dongwook [1 ]
Park, Juyeon [2 ]
Chung, Hee Cheol [3 ,4 ]
Jeong, Seonghyun [5 ,6 ]
机构
[1] Nexon, Seongnam Si 13487, Gyeonggi Do, South Korea
[2] Danggeun Market, Seoul 06611, South Korea
[3] Univ N Carolina, Dept Math & Stat, Charlotte, NC 28223 USA
[4] Univ N Carolina, Sch Data Sci, Charlotte, NC 28223 USA
[5] Yonsei Univ, Dept Stat & Data Sci, Seoul 03722, South Korea
[6] Yonsei Univ, Dept Appl Stat, Seoul 03722, South Korea
基金
新加坡国家研究基金会;
关键词
Anomaly detection; Gaussian mixture models; Outlier ensembles; Random projection; Variational inference; ANOMALY DETECTION; PROJECTION; MODEL;
D O I
10.1016/j.patcog.2024.110846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic mixture models are recognized as effective tools for unsupervised outlier detection owing their interpretability and global characteristics. Among these, Dirichlet process mixture models stand strong alternative to conventional finite mixture models for both clustering and outlier detection tasks. finite mixture models, Dirichlet process mixtures are infinite mixture models that automatically determine number of mixture components based on the data. Despite their advantages, the adoption of Dirichlet mixture models for unsupervised outlier detection has been limited by challenges related to computational inefficiency and sensitivity to outliers in the construction of outlier detectors. Additionally, Dirichlet Gaussian mixtures struggle to effectively model non-Gaussian data with discrete or binary features. To these challenges, we propose a novel outlier detection method that utilizes ensembles of Dirichlet Gaussian mixtures. This unsupervised algorithm employs random subspace and subsampling ensembles ensure efficient computation and improve the robustness of the outlier detector. The ensemble approach improves the suitability of the proposed method for detecting outliers in non-Gaussian data. Furthermore, our method uses variational inference for Dirichlet process mixtures, which ensures both efficient and computation. Empirical analyses using benchmark datasets demonstrate that our method outperforms approaches in unsupervised outlier detection.
引用
收藏
页数:14
相关论文
共 29 条
  • [1] Bayesian Outlier Detection with Dirichlet Process Mixtures
    Shotwell, Matthew S.
    Slate, Elizabeth H.
    BAYESIAN ANALYSIS, 2011, 6 (04): : 665 - 690
  • [2] On-line unsupervised outlier detection using finite mixtures with discounting learning algorithms
    Yamanishi, K
    Takeuchi, JI
    Williams, G
    Milne, P
    DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (03) : 275 - 300
  • [3] On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms
    Kenji Yamanishi
    Jun-ichi Takeuchi
    Graham Williams
    Peter Milne
    Data Mining and Knowledge Discovery, 2004, 8 : 275 - 300
  • [4] Unsupervised Time Series Outlier Detection with Diversity-Driven Convolutional Ensembles
    Campos, David
    Kieu, Tung
    Guo, Chenjuan
    Huang, Feiteng
    Zheng, Kai
    Yang, Bin
    Jensen, Christian S.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (03): : 611 - 623
  • [5] Automatic optimization of outlier detection ensembles using a limited number of outlier examples
    Reunanen, Niko
    Raty, Tomi
    Lintonen, Timo
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 10 (04) : 377 - 394
  • [6] Unsupervised Outlier Detection Using Memory and Contrastive Learning
    Huyan, Ning
    Quan, Dou
    Zhang, Xiangrong
    Liang, Xuefeng
    Chanussot, Jocelyn
    Jiao, Licheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6440 - 6454
  • [7] Anomaly Detection for Insider Threats Using Unsupervised Ensembles
    Le, Duc C.
    Zincir-Heywood, Nur
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (02): : 1152 - 1164
  • [8] ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions
    Li, Zheng
    Zhao, Yue
    Hu, Xiyang
    Botta, Nicola
    Ionescu, Cezar
    Chen, George H.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12181 - 12193
  • [9] Unsupervised Outlier Detection in IOT Using Deep VAE
    Gouda, Walaa
    Tahir, Sidra
    Alanazi, Saad
    Almufareh, Maram
    Alwakid, Ghadah
    SENSORS, 2022, 22 (17)
  • [10] Unsupervised Outlier Detection based on Random Projection Outlyingness with Local Score Weighting
    Tamamori, Akira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (07) : 1244 - 1248