Unsupervised Outlier Detection: A Meta-Learning Algorithm Based on Feature Selection

被引:7
|
作者
Papastefanopoulos, Vasilis [1 ]
Linardatos, Pantelis [1 ]
Kotsiantis, Sotiris [1 ]
机构
[1] Univ Patras, Dept Math, Patras 26504, Greece
关键词
machine learning; data science; unsupervised outlier; detection; meta-learning; feature selection; ensemble-learning;
D O I
10.3390/electronics10182236
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection refers to the problem of the identification and, where appropriate, the elimination of anomalous observations from data. Such anomalous observations can emerge due to a variety of reasons, including human or mechanical errors, fraudulent behaviour as well as environmental or systematic changes, occurring either naturally or purposefully. The accurate and timely detection of deviant observations allows for the early identification of potentially extensive problems, such as fraud or system failures, before they escalate. Several unsupervised outlier detection methods have been developed; however, there is no single best algorithm or family of algorithms, as typically each relies on a measure of 'outlierness' such as density or distance, ignoring other measures. To add to that, in an unsupervised setting, the absence of ground-truth labels makes finding a single best algorithm an impossible feat even for a single given dataset. In this study, a new meta-learning algorithm for unsupervised outlier detection is introduced in order to mitigate this problem. The proposed algorithm, in a fully unsupervised manner, attempts not only to combine the best of many worlds from the existing techniques through ensemble voting but also mitigate any undesired shortcomings by employing an unsupervised feature selection strategy in order to identify the most informative algorithms for a given dataset. The proposed methodology was evaluated extensively through experimentation, where it was benchmarked and compared against a wide range of commonly-used techniques for outlier detection. Results obtained using a variety of widely accepted datasets demonstrated its usefulness and its state-of-the-art results as it topped the Friedman ranking test for both the area under receiver operating characteristic (ROC) curve and precision metrics when averaged over five independent trials.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Feature Selection Algorithm Ensembling Based on Meta-Learning
    Tanfilev, Igor
    Filchenkov, Andrey
    Smetannikov, Ivan
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [2] Recommendation method for avionics feature selection algorithm based on meta-learning
    Li R.
    Xu A.
    Sun W.
    Wang S.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (07): : 2011 - 2020
  • [3] Summary of algorithm selection problem based on meta-learning
    Zeng, Z.-L. (zzljxnu@163.com), 1600, Northeast University (29): : 961 - 968
  • [4] Meta-learning based industrial intelligence of feature nearest algorithm selection framework for classification problems
    Li, Li
    Wang, Yong
    Xu, Ying
    Lin, Kuo-Yi
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 767 - 776
  • [5] Unsupervised Feature Selection for Outlier Detection on Streaming Data to Enhance Network Security
    Heigl, Michael
    Weigelt, Enrico
    Fiala, Dalibor
    Schramm, Martin
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [6] A Survey of Personalized Recommendation Algorithm Selection Based on Meta-learning
    Ren, Yi
    Chi, Cuirong
    Zhang Jintao
    CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 1383 - 1388
  • [7] A review on preprocessing algorithm selection with meta-learning
    Pedro B. Pio
    Adriano Rivolli
    André C. P. L. F. de Carvalho
    Luís P. F. Garcia
    Knowledge and Information Systems, 2024, 66 (1) : 1 - 28
  • [8] A review on preprocessing algorithm selection with meta-learning
    Pio, Pedro B.
    Rivolli, Adriano
    de Carvalho, Andre C. P. L. F.
    Garcia, Luis P. F.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (01) : 1 - 28
  • [9] Deepfake detection using deep feature stacking and meta-learning
    Naskar, Gourab
    Mohiuddin, Sk
    Malakar, Samir
    Cuevas, Erik
    Sarkar, Ram
    HELIYON, 2024, 10 (04)
  • [10] AutoFe-Sel: A Meta-learning based methodology for Recommending Feature Subset Selection Algorithms
    Khan, Irfan
    Zhang, Xianchao
    Ayyasamy, Ramesh Kumar
    Ali, Rahman
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (07): : 1773 - 1793