Multigranulation Relative Entropy-Based Mixed Attribute Outlier Detection in Neighborhood Systems

被引:28
作者
Yuan, Zhong [1 ]
Chen, Hongmei [1 ]
Li, Tianrui [1 ]
Zhang, Xianyong [2 ]
Sang, Binbin [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Sichuan Normal Univ, Sch Math Sci, Chengdu 610066, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 08期
基金
中国国家自然科学基金;
关键词
Anomaly detection; Rough sets; Entropy; Uncertainty; Clustering algorithms; Numerical models; Information entropy; Mixed attribute; multigranulation; neighborhood rough set theory; outlier detection; relative entropy; KNOWLEDGE GRANULATION; INFORMATION ENTROPY; SET; UNCERTAINTY; ALGORITHMS; REDUCTION; NETWORK;
D O I
10.1109/TSMC.2021.3119119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is widely used in many fields, such as intrusion detection, credit card fraud detection, medical diagnosis, and so on. Existing outlier detection algorithms are mostly designed for dealing with numeric or categorical attributes. However, data usually exist in the form of mixed attributes in real-world applications. In this article, we propose a novel mixed attribute outlier detection method based on multigranulation relative entropy by employing the neighborhood rough set. First, the neighborhood system is constructed by optimizing the mixed distance metric and the radius of the statistical value. Second, the neighborhood entropy is introduced as an uncertainty measure of data. Furthermore, the three kinds of multigranulation relative entropy-based matrices are defined by three kinds of attribute sequences, and the multigranulation relative entropy-based outlier factor is integrated to indicate the outlier degree of every object. Based on the proposed outlier detection model, the corresponding algorithm is designed. Finally, the proposed algorithm is compared with other nine algorithms through experiments on public data. The experimental results show that the proposed technique is adaptive and effective.
引用
收藏
页码:5175 / 5187
页数:13
相关论文
共 52 条
[1]  
Aggarwal C., 2016, Outlier Analysis, V2nd
[2]  
Aggarwal CC, 2001, SIGMOD RECORD, V30, P37
[3]   Rough Sets, Kernel Set, and Spatiotemporal Outlier Detection [J].
Albanese, Alessia ;
Pal, Sankar K. ;
Petrosino, Alfredo .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) :194-207
[4]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[5]   On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study [J].
Campos, Guilherme O. ;
Zimek, Arthur ;
Sander, Jorg ;
Campello, Ricardo J. G. B. ;
Micenkova, Barbora ;
Schubert, Erich ;
Assent, Ira ;
Houle, Michael E. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (04) :891-927
[6]   Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models [J].
Chen, Degang ;
Yang, Yanyan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) :1325-1334
[7]   Parallel attribute reduction in dominance-based neighborhood rough set [J].
Chen, Hongmei ;
Li, Tianrui ;
Cai, Yong ;
Luo, Chuan ;
Fujita, Hamido .
INFORMATION SCIENCES, 2016, 373 :351-368
[8]   A Decision-Theoretic Rough Set Approach for Dynamic Data Mining [J].
Chen, Hongmei ;
Li, Tianrui ;
Luo, Chuan ;
Horng, Shi-Jinn ;
Wang, Guoyin .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (06) :1958-1970
[9]   Neighborhood outlier detection [J].
Chen, Yumin ;
Miao, Duoqian ;
Zhang, Hongyun .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (12) :8745-8749
[10]  
Chen YM, 2008, LECT NOTES ARTIF INT, V5306, P283, DOI 10.1007/978-3-540-88425-5_29