Internal Evaluation of Unsupervised Outlier Detection

被引:21
|
作者
Marques, Henrique O. [1 ]
Campello, Ricardo J. G. B. [2 ]
Sander, Jorg [3 ]
Zimek, Arthur [4 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci ICMC, BR-13566590 Sao Carlos, SP, Brazil
[2] Univ Newcastle, Sch Math & Phys Sci MAPS, Univ Dr, Callaghan, NSW 2308, Australia
[3] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
[4] Univ Southern Denmark, Dept Math & Comp Sci IMADA, Campusvej 55, DK-5230 Odense, Denmark
基金
瑞典研究理事会; 加拿大自然科学与工程研究理事会; 巴西圣保罗研究基金会;
关键词
Outlier detection; unsupervised evaluation; validation; DISTANCE-BASED OUTLIERS;
D O I
10.1145/3394053
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although there is a large and growing literature that tackles the unsupervised outlier detection problem, the unsupervised evaluation of outlier detection results is still virtually untouched in the literature. The so-called internal evaluation, based solely on the data and the assessed solutions themselves, is required if one wants to statistically validate (in absolute terms) or just compare (in relative terms) the solutions provided by different algorithms or by different parameterizations of a given algorithm in the absence of labeled data. However, in contrast to unsupervised cluster analysis, where indexes for internal evaluation and validation of clustering solutions have been conceived and shown to be very useful, in the outlier detection domain, this problem has been notably overlooked. Here we discuss this problem and provide a solution for the internal evaluation of outlier detection results. Specifically, we describe an index called Internal, Relative Evaluation of Outlier Solutions (IREOS) that can evaluate and compare different candidate outlier detection solutions. Initially, the index is designed to evaluate binary solutions only, referred to as top-n outlier detection results. We then extend IREOS to the general case of non-binary solutions, consisting of outlier detection scorings. We also statistically adjust IREOS for chance and extensively evaluate it in several experiments involving different collections of synthetic and real datasets.
引用
收藏
页数:42
相关论文
共 50 条
  • [21] Unsupervised outlier detection in heavy-ion collisions
    Thaprasop, P.
    Zhou, K.
    Steinheimer, J.
    Herold, C.
    PHYSICA SCRIPTA, 2021, 96 (06)
  • [22] Graph autoencoder-based unsupervised outlier detection
    Du, Xusheng
    Yu, Jiong
    Chu, Zheng
    Jin, Lina
    Chen, Jiaying
    INFORMATION SCIENCES, 2022, 608 : 532 - 550
  • [23] Unsupervised Outlier Detection Using Memory and Contrastive Learning
    Huyan, Ning
    Quan, Dou
    Zhang, Xiangrong
    Liang, Xuefeng
    Chanussot, Jocelyn
    Jiao, Licheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6440 - 6454
  • [24] Unsupervised Outlier Detection Mechanism for Tea Traceability Data
    Yang, Honggang
    Li, Shaowen
    Tu, Lijing
    Ma, Rongrong
    Chen, Yin
    IEEE ACCESS, 2022, 10 : 94818 - 94831
  • [25] An outlier ensemble for unsupervised anomaly detection in honeypots data
    Boukela, Lynda
    Zhang, Gongxuan
    Bouzefrane, Samia
    Zhou, Junlong
    INTELLIGENT DATA ANALYSIS, 2020, 24 (04) : 743 - 758
  • [26] Unsupervised Outlier Detection via Transformation Invariant Autoencoder
    Cheng, Zhen
    Zhu, En
    Wang, Siqi
    Zhang, Pei
    Li, Wang
    IEEE ACCESS, 2021, 9 : 43991 - 44002
  • [27] Benchmarking Unsupervised Outlier Detection with Realistic Synthetic Data
    Steinbuss, Georg
    Boehm, Klemens
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (04)
  • [28] UNSUPERVISED ANOMALY DETECTION FOR TIME SERIES WITH OUTLIER EXPOSURE
    Feng, Jiaming
    Huang, Zheng
    Guo, Jie
    Qiu, Weidong
    33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 1 - 12
  • [29] Unsupervised Outlier Detection in IOT Using Deep VAE
    Gouda, Walaa
    Tahir, Sidra
    Alanazi, Saad
    Almufareh, Maram
    Alwakid, Ghadah
    SENSORS, 2022, 22 (17)
  • [30] Generative Adversarial Active Learning for Unsupervised Outlier Detection
    Liu, Yezheng
    Li, Zhe
    Zhou, Chong
    Jiang, Yuanchun
    Sun, Jianshan
    Wang, Meng
    He, Xiangnan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1517 - 1528