A fuzzy rough set-based undersampling approach for imbalanced data

被引:1
|
作者
Zhang, Xiao [1 ]
He, Zhaoqian [1 ]
Yang, Yanyan [2 ]
机构
[1] Xian Univ Technol, Dept Appl Math, 58 Yanxiang Rd, Xian 710054, Shanxi, Peoples R China
[2] Beijing Jiaotong Univ, Sch Software Engn, Beixiaguan Rd, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced data; Fuzzy rough sets; Undersampling; Instance selection; CLASSIFIERS; REDUCTION;
D O I
10.1007/s13042-023-02064-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to effectively handle imbalanced data is one of the hot issues in the fields of machine learning and data mining. Undersampling is a popular technique of dealing with imbalanced data. The aim of undersampling is to select an instance subset from the majority class of an imbalanced dataset and then make the dataset balanced. However, the traditional undersampling approaches may lead to the information loss of majority class instances. Therefore, on the basis of the concept of the importance degree of a fuzzy granule, a measure criterion of selecting representative instances from the majority class is presented in this paper by considering the fuzzy relations between the k-nearest neighbors of a majority class instance and the minority class instances. Then, we put forward an undersampling approach based on fuzzy rough sets (USFRS). With the proposed USFRS, the representativeness of the selected majority class instances can be guaranteed and the information loss due to undersampling can be reduced to the utmost extent. Furthermore, USFRS is compared with the relative undersampling methods, and the difference of the experimental results is analyzed by the statistic test. The experimental results demonstrate that USFRS performs well in classification for imbalanced data.
引用
收藏
页码:2799 / 2810
页数:12
相关论文
共 50 条
  • [31] Development of a rough set-based fuzzy neural network for online monitoring of microdrilling
    Yang, ZhaoJun
    Li, Xue
    Jia, QingXiang
    Sun, YanHong
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2009, 41 (3-4): : 219 - 225
  • [32] Intuitionistic Fuzzy Rough Set-Based Granular Structures and Attribute Subset Selection
    Tan, Anhui
    Wu, Wei-Zhi
    Qian, Yuhua
    Liang, Jiye
    Chen, Jinkun
    Li, Jinjin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (03) : 527 - 539
  • [33] A fuzzy rough set-based feature selection method using representative instances
    Zhang, Xiao
    Mei, Changlin
    Chen, Degang
    Yang, Yanyan
    KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 216 - 229
  • [34] IFSDA: An Intuitionistic Fuzzy Set-based Data Aggregation Approach for Software Maintainability Evaluation
    Nan, Yan
    Zhang, Hengshan
    Zheng, Qinghua
    Wang, Di
    Liu, Ting
    Feng, Boqin
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 237 - 242
  • [35] Clustering Based Undersampling for Effective Learning from Imbalanced Data: An Iterative Approach
    Bhattacharya R.
    De R.
    Chakraborty A.
    Sarkar R.
    SN Computer Science, 5 (4)
  • [36] A rough set-based approach to handling spatial uncertainty in binary images
    Sinha, D
    Laplante, P
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, 17 (01) : 97 - 110
  • [37] On the Definability of a Set and Rough Set-Based Rule Generation
    Sakai, Hiroshi
    Wu, Mao
    Yamaguchi, Naoto
    2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014), 2014, : 122 - 125
  • [38] Optimization of Sensor Array in Electronic Nose: A Rough Set-Based Approach
    Bag, Anil Kumar
    Tudu, Bipan
    Roy, Jayashri
    Bhattacharyya, Nabarun
    Bandyopadhyay, Rajib
    IEEE SENSORS JOURNAL, 2011, 11 (11) : 3001 - 3008
  • [39] Evolutionary computation and rough set-based hybrid approach to rule generation
    Shang, L
    Wan, Q
    Zhao, ZH
    Chen, SF
    ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 855 - 862
  • [40] Rough set-based approach for modeling relationship measures in product planning
    Li, Yan-Lai
    Tang, Jia-Fu
    Chin, Kwai-Sang
    Luo, Xing-Gang
    Han, Yi
    INFORMATION SCIENCES, 2012, 193 : 199 - 217