Neighborhood attribute reduction for imbalanced data

被引:5
|
作者
Zhang, Wendong [1 ]
Wang, Xun [1 ]
Yang, Xibei [1 ]
Chen, Xiangjian [1 ]
Wang, Pingxin [1 ,2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Sci, Zhenjiang 212003, Jiangsu, Peoples R China
关键词
Attribute reduction; Granular computing; K-means; Neighborhood decision error rate; Neighborhood classifier; SMOTE; ROUGH SET; FEATURE-SELECTION;
D O I
10.1007/s41066-018-0105-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the viewpoint of rough granular computing, neighborhood decision error rate-based attribute reduction aims to improve the classification performance of the neighborhood classifier. Nevertheless, for imbalanced data which can be seen everywhere in real-world applications, such reduction does not pay much attention to the classification results of samples in minority class. Therefore, a new strategy to attribute reduction is proposed, which is embedded with preprocessing of the imbalanced data. First, the widely accepted SMOTE algorithm and K-means algorithm are used for oversampling and undersampling, respectively. Second, the neighborhood decision error rate-based attribute reduction is designed for those updated data. Finally, the neighborhood classifier can be tested with the attributes in reducts. The experimental results on some UCI and PROMISE data sets show that our approach is superior to the traditional attribute reduction based on the evaluations of F-measure and G-mean. Therefore, the contribution of this paper is to construct the attribute reduction strategy for imbalanced data, which can select useful attributes for improving the classification performance in such data.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 50 条
  • [31] Attribute reduction based on k-nearest neighborhood rough sets
    Wang, Changzhong
    Shi, Yunpeng
    Fan, Xiaodong
    Shao, Mingwen
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 106 : 18 - 31
  • [32] Neighborhood Based Multi-Granularity Attribute Reduction: An Acceleration Approach
    Song, Jingjing
    Dou, Huili
    Rao, Xiansheng
    Luo, Xiaojing
    Yan, Xuan
    FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 234 - 246
  • [33] A novel approach to attribute reduction based on weighted neighborhood rough sets
    Hu, Meng
    Tsang, Eric C. C.
    Guo, Yanting
    Chen, Degang
    Xu, Weihua
    KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [34] Incremental attribute reduction algorithm based on neighborhood granulation conditional entropy
    Zhao X.-L.
    Yang Y.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (10): : 2061 - 2072
  • [35] A dynamic attribute reduction algorithm based on relative neighborhood discernibility degree
    Feng, Weibing
    Sun, Tiantian
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [36] Conditional Neighborhood Entropy with Granulation Monotonicity and Its Relevant Attribute Reduction
    Zhou Y.
    Zhang X.
    Mo Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (11): : 2395 - 2405
  • [37] Discernibility matrix based incremental attribute reduction for dynamic data
    Wei, Wei
    Wu, Xiaoying
    Liang, Jiye
    Cui, Junbiao
    Sun, Yijun
    KNOWLEDGE-BASED SYSTEMS, 2018, 140 : 142 - 157
  • [38] Research on Attribute Reduction Using Rough Neighborhood Model
    He, Ming
    Du, Yong-ping
    ISBIM: 2008 INTERNATIONAL SEMINAR ON BUSINESS AND INFORMATION MANAGEMENT, VOL 1, 2009, : 268 - 270
  • [39] Hypersphere Neighborhood Rough Set for Rapid Attribute Reduction
    Fang, Yu
    Cao, Xue-Mei
    Wang, Xin
    Min, Fan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 161 - 173
  • [40] A novel method to attribute reduction based on weighted neighborhood probabilistic rough sets
    Xie, Jingjing
    Hu, Bao Qing
    Jiang, Haibo
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 144 : 1 - 17