Outlier detection for set-valued data based on rough set theory and granular computing

被引:6
|
作者
Lin, Hai [1 ]
Li, Zhaowen [2 ]
机构
[1] Guangxi Univ, Coll Math & Informat Sci, Nanning, Guangxi, Peoples R China
[2] Yulin Normal Univ, Key Lab Complex Syst Optimizat & Big Data Proc, Dept Guangxi Educ, Yulin, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
RST; GrC; SVIS; outlier detection; outlier factor; INFORMATION GRANULATION; ATTRIBUTE REDUCTION; FUZZY; ALGORITHMS;
D O I
10.1080/03081079.2022.2132491
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Outlier detection has been broadly used in industrial practices such as public security and fraud detection, etc. Outlier detection from various perspectives against different backgrounds has been proposed. However, most of outlier detection consider categorical or numerical data. There are few researches on outlier detection for set-valued data, and a set-valued information system (SVIS) is a proper way of tackling the problem of missing values in data sets. This paper investigates outlier detection for set-valued data based on rough set theory (RST) and granular computing (GrC). First, the similarity between two information values in an SVIS is introduced and a variable parameter to control the similarity is given. Then, the tolerance relations on the object set are defined, and based on this tolerance relation, theta-lower and theta-upper approximations in an SVIS are put forward. Next, the outlier factor in an SVIS is presented and applied to various data sets. Finally, outlier detection method for set-valued data based on RST and GrC is proposed, and the corresponding algorithms are designed. Through numerical experiments based on UCI, the designed algorithm is compared with six other detection algorithms. The experimental results show the designed algorithm is arguably the best choice under the context of an SVIS. It is worth mentioning that for a comprehensive comparison, we use two criteria: AUC value and F-1 measure, to show the superiority of the designed algorithm.
引用
收藏
页码:385 / 413
页数:29
相关论文
共 50 条
  • [21] Uncertainty measurement for incomplete set-valued data with application to attribute reduction
    Song, Yan
    Luo, Damei
    Xie, Ningxin
    Li, Zhaowen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (10) : 3031 - 3069
  • [22] Dynamic variable precision rough set approach for probabilistic set-valued information systems
    Huang, Yanyong
    Li, Tianrui
    Luo, Chuan
    Fujita, Hamido
    Horng, Shi-jinn
    KNOWLEDGE-BASED SYSTEMS, 2017, 122 : 131 - 147
  • [23] Attribute reduction for set-valued data based on prediction label
    Yang, Taoli
    Li, Zhaowen
    Li, Jinjin
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2023, 52 (06) : 745 - 775
  • [24] Feature selection for incomplete set-valued data
    Li, Lulu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 1217 - 1235
  • [25] Rough Sets, Kernel Set, and Spatiotemporal Outlier Detection
    Albanese, Alessia
    Pal, Sankar K.
    Petrosino, Alfredo
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 194 - 207
  • [26] Outlier detection algorithm for categortical data using a granular computing theory
    Li, Min
    2014 IEEE WORKSHOP ON ELECTRONICS, COMPUTER AND APPLICATIONS, 2014, : 457 - 459
  • [27] Fast algorithms for computing rough approximations in set-valued decision systems while updating criteria values
    Luo, Chuan
    Li, Tianrui
    Chen, Hongmei
    Lu, Lixia
    INFORMATION SCIENCES, 2015, 299 : 221 - 242
  • [28] Outlier Detection Based on Granular Computing
    Chen, Yuming
    Miao, Duoqian
    Wang, Ruizhi
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2008, 5306 : 283 - 292
  • [29] A New Weighting Approach Based on Rough Set Theory and Granular Computing for Road Safety Indicator Analysis
    Li, Tianrui
    Ruan, Da
    Shen, Yongjun
    Hermans, Elke
    Wets, Geert
    COMPUTATIONAL INTELLIGENCE, 2016, 32 (04) : 517 - 534
  • [30] A survey on granular computing and its uncertainty measure from the perspective of rough set theory
    Cheng, Yunlong
    Zhao, Fan
    Zhang, Qinghua
    Wang, Guoyin
    GRANULAR COMPUTING, 2021, 6 (01) : 3 - 17