Cluster-based outlier detection

被引:144
|
作者
Duan, Lian [1 ]
Xu, Lida [2 ,3 ]
Liu, Ying [4 ]
Lee, Jun [5 ]
机构
[1] Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USA
[2] Beijing Jiaotong Univ, Coll Econ & Management, Beijing 100044, Peoples R China
[3] Old Dominion Univ, Dept Informat Technol & Decis Sci, Norfolk, VA 23529 USA
[4] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing, Peoples R China
[5] Chinese Acad Sci, China Sci & Technol Network, Beijing, Peoples R China
关键词
Outlier detection; Cluster-based outlier; LDBSCAN; Local outlier factor; FEATURE SPACE THEORY;
D O I
10.1007/s10479-008-0371-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Outlier detection has important applications in the field of data mining, such as fraud detection, customer behavior analysis, and intrusion detection. Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Outliers are traditionally considered as single points; however, there is a key observation that many abnormal events have both temporal and spatial locality, which might form small clusters that also need to be deemed as outliers. In other words, not only a single point but also a small cluster can probably be an outlier. In this paper, we present a new definition for outliers: cluster-based outlier, which is meaningful and provides importance to the local data behavior, and how to detect outliers by the clustering algorithm LDBSCAN (Duan et al. in Inf. Syst. 32(7):978-986, 2007) which is capable of finding clusters and assigning LOF (Breunig et al. in Proceedings of the 2000 ACM SIG MOD International Conference on Manegement of Data, ACM Press, pp. 93-104, 2000) to single points.
引用
收藏
页码:151 / 168
页数:18
相关论文
共 50 条
  • [41] Intrusion Detection Framework of Cluster-based Wireless Sensor Network
    Sedjelmaci, Hichem
    Senouci, Sidi Mohammed
    Feham, Mohammed
    2012 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2012, : 857 - 861
  • [42] Efficient density and cluster based incremental outlier detection in data streams
    Degirmenci, Ali
    Karal, Omer
    INFORMATION SCIENCES, 2022, 607 : 901 - 920
  • [43] A method for outlier detection based on cluster analysis and visual expert criteria
    Lara, Juan A.
    Lizcano, David
    Ramperez, Victor
    Soriano, Javier
    EXPERT SYSTEMS, 2020, 37 (05)
  • [44] Centralized IDS Based on Misuse Detection for Cluster-Based Wireless Sensors Networks
    Faouzi Hidoussi
    Homero Toral-Cruz
    Djallel Eddine Boubiche
    Kamaljit Lakhtaria
    Albena Mihovska
    Miroslav Voznak
    Wireless Personal Communications, 2015, 85 : 207 - 224
  • [45] Centralized IDS Based on Misuse Detection for Cluster-Based Wireless Sensors Networks
    Hidoussi, Faouzi
    Toral-Cruz, Homero
    Boubiche, Djallel Eddine
    Lakhtaria, Kamaljit
    Mihovska, Albena
    Voznak, Miroslav
    WIRELESS PERSONAL COMMUNICATIONS, 2015, 85 (01) : 207 - 224
  • [46] Cluster-Based Query Expansion
    Kalmanovich, Inna Gelfer
    Kurland, Oren
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 646 - 647
  • [47] Cluster Integration for the Cluster-Based Instance Selection
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT I, 2010, 6421 : 353 - 362
  • [48] A cluster-based approach to fault detection and recovery in wireless sensor networks
    Venkataraman, Gayathri
    Emnianuel, Sabu
    Thambipilla, Srikanthan
    2007 FOURTH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 782 - 786
  • [49] Geometrical Cluster-based Scatterer Detection Method with the Movement of Mobile Terminal
    Luan, Fengyu
    Molisch, Andreas F.
    Xiao, Limin
    Tufvesson, Fredrik
    Zhou, Shidong
    2015 IEEE 81ST VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2015,
  • [50] Cluster-based cumulative ensembles
    Ayad, HG
    Kamel, MS
    MULTIPLE CLASSIFIER SYSTEMS, 2005, 3541 : 236 - 245