Neighborhood attribute reduction for imbalanced data

被引:5
|
作者
Zhang, Wendong [1 ]
Wang, Xun [1 ]
Yang, Xibei [1 ]
Chen, Xiangjian [1 ]
Wang, Pingxin [1 ,2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Sci, Zhenjiang 212003, Jiangsu, Peoples R China
关键词
Attribute reduction; Granular computing; K-means; Neighborhood decision error rate; Neighborhood classifier; SMOTE; ROUGH SET; FEATURE-SELECTION;
D O I
10.1007/s41066-018-0105-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the viewpoint of rough granular computing, neighborhood decision error rate-based attribute reduction aims to improve the classification performance of the neighborhood classifier. Nevertheless, for imbalanced data which can be seen everywhere in real-world applications, such reduction does not pay much attention to the classification results of samples in minority class. Therefore, a new strategy to attribute reduction is proposed, which is embedded with preprocessing of the imbalanced data. First, the widely accepted SMOTE algorithm and K-means algorithm are used for oversampling and undersampling, respectively. Second, the neighborhood decision error rate-based attribute reduction is designed for those updated data. Finally, the neighborhood classifier can be tested with the attributes in reducts. The experimental results on some UCI and PROMISE data sets show that our approach is superior to the traditional attribute reduction based on the evaluations of F-measure and G-mean. Therefore, the contribution of this paper is to construct the attribute reduction strategy for imbalanced data, which can select useful attributes for improving the classification performance in such data.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 50 条
  • [41] Attribute reduction with fuzzy rough set based on multiobjective neighborhood difference algorithm
    Li B.-Y.
    Xiao J.-M.
    Wang X.-H.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (05): : 947 - 955
  • [42] Attribute reduction based on max-decision neighborhood rough set model
    Fan, Xiaodong
    Zhao, Weida
    Wang, Changzhong
    Huang, Yang
    KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 16 - 23
  • [43] Supervised information granulation strategy for attribute reduction
    Liu, Keyu
    Yang, Xibei
    Yu, Hualong
    Fujita, Hamido
    Chen, Xiangjian
    Liu, Dun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2149 - 2163
  • [44] Attribute Reduction for Heterogeneous Data by Hybrid Neighborhood Graph Structure and Neighbor Inconsistent Pair Selection
    Dai, Jianhua
    Liu, Jie
    Ding, Weiping
    Zhang, Chucai
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [45] RETRACTED: A composite entropy-based uncertainty measure guided attribute reduction for imbalanced mixed-type data (Retracted Article)
    Shu, Wenhao
    Li, Shipeng
    Qian, Wenbin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 7307 - 7325
  • [46] Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Seliya, Naeem
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 1, 2010,
  • [47] Attribute Reduction of Boolean Matrix in Neighborhood Rough Set Model
    Gao, Yan
    Lv, Changwei
    Wu, Zhengjiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 1473 - 1482
  • [48] Attribute Reduction of Boolean Matrix in Neighborhood Rough Set Model
    Yan Gao
    Changwei Lv
    Zhengjiang Wu
    International Journal of Computational Intelligence Systems, 2020, 13 : 1473 - 1482
  • [49] Spectral Clustering with Neighborhood Attribute Reduction Based on Information Entropy
    Jia, Hongjie
    Ding, Shifei
    Ma, Heng
    Xing, Wanqiu
    JOURNAL OF COMPUTERS, 2014, 9 (06) : 1316 - 1324
  • [50] Attribute reduction using self-information uncertainty measures in optimistic neighborhood extreme-granulation rough set
    Qu, Kanglin
    Gao, Pan
    Dai, Qun
    Sun, Yuanhao
    Hua, Xu
    INFORMATION SCIENCES, 2025, 686