Cluster-based outlier detection

被引:144
|
作者
Duan, Lian [1 ]
Xu, Lida [2 ,3 ]
Liu, Ying [4 ]
Lee, Jun [5 ]
机构
[1] Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USA
[2] Beijing Jiaotong Univ, Coll Econ & Management, Beijing 100044, Peoples R China
[3] Old Dominion Univ, Dept Informat Technol & Decis Sci, Norfolk, VA 23529 USA
[4] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing, Peoples R China
[5] Chinese Acad Sci, China Sci & Technol Network, Beijing, Peoples R China
关键词
Outlier detection; Cluster-based outlier; LDBSCAN; Local outlier factor; FEATURE SPACE THEORY;
D O I
10.1007/s10479-008-0371-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Outlier detection has important applications in the field of data mining, such as fraud detection, customer behavior analysis, and intrusion detection. Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Outliers are traditionally considered as single points; however, there is a key observation that many abnormal events have both temporal and spatial locality, which might form small clusters that also need to be deemed as outliers. In other words, not only a single point but also a small cluster can probably be an outlier. In this paper, we present a new definition for outliers: cluster-based outlier, which is meaningful and provides importance to the local data behavior, and how to detect outliers by the clustering algorithm LDBSCAN (Duan et al. in Inf. Syst. 32(7):978-986, 2007) which is capable of finding clusters and assigning LOF (Breunig et al. in Proceedings of the 2000 ACM SIG MOD International Conference on Manegement of Data, ACM Press, pp. 93-104, 2000) to single points.
引用
收藏
页码:151 / 168
页数:18
相关论文
共 50 条
  • [31] Cluster-based selection
    Dunbar, JB
    PERSPECTIVES IN DRUG DISCOVERY AND DESIGN, 1997, 7-8 : 51 - 63
  • [32] On complementarity of cluster and outlier detection schemes
    Chen, ZX
    Fu, AWC
    Tang, J
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2003, 2737 : 234 - 243
  • [33] Joint detection and estimation for cooperative communications in cluster-based networks
    Wang, Tsang-Yi
    Pu, Jyun-Wei
    Li, Chih-Peng
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2013, 13 (17): : 1511 - 1519
  • [34] PERFORMANCE EVALUATION OF CLUSTER-BASED HYPERSPECTRAL TARGET DETECTION ALGORITHMS
    Pieper, M.
    Manolakis, D.
    Truslow, E.
    Cooley, T.
    Lipson, S.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 2669 - 2672
  • [35] Cluster-Based Landmark and Event Detection for Tagged Photo Collections
    Papadopoulos, Symeon
    Zigkolis, Christos
    Kompatsiaris, Yiannis
    Vakali, Athena
    IEEE MULTIMEDIA, 2011, 18 (01) : 52 - 62
  • [36] Data Randomization and Cluster-Based Partitioning for Botnet Intrusion Detection
    Al-Jarrah, Omar Y.
    Alhussein, Omar
    Yoo, Paul D.
    Muhaidat, Sami
    Taha, Kamal
    Kim, Kwangjo
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (08) : 1796 - 1806
  • [37] Fuzzy Cluster-Based Method of Hotspot Detection with Limited Information
    Bandyopadhyaya, Ranja
    Mitra, Sudeshna
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2015, 7 (04) : 307 - 323
  • [38] Cluster-based resilient distributed estimation through adversary detection
    Gao, Fengyue
    Yu, Quan
    Bai, Lin
    Wang, Jingchao
    Choi, Jinho
    IET COMMUNICATIONS, 2020, 14 (03) : 451 - 457
  • [39] Intrusion Detection Framework of Cluster-based Wireless Sensor Network
    Sedjelmaci, Hichem
    Senouci, Sidi Mohammed
    Feham, Mohammed
    2012 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2012, : 893 - 897
  • [40] Joint Detection and Estimation for Cooperative Communications in Cluster-Based Networks
    Wang, Tsang-Yi
    Pu, Jyun-Wei
    Li, Chih-Peng
    2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 4401 - 4405