NEC: A nested equivalence class-based dependency calculation approach for fast feature selection using rough set theory

被引:13
|
作者
Zhao, Jie [1 ,2 ]
Liang, Jia-Ming [1 ]
Dong, Zhen-Ning [1 ]
Tang, De-Yu [3 ]
Liu, Zhen [3 ]
机构
[1] Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
[2] Cornell Univ, Sch Elect & Comp Engn, New York, NY 14850 USA
[3] Guangdong Pharmaceut Univ, Sch Med Informat & Engn, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Rough set theory; Attribute reduction; Positive region; Heuristic algorithm; Swarm intelligence; ATTRIBUTE REDUCTION; DECISION SYSTEMS; OPTIMIZATION; ALGORITHM; CLASSIFICATION; APPROXIMATION; TABLES; PSO;
D O I
10.1016/j.ins.2020.03.092
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection plays an important role in data mining and machine learning tasks. As one of the most effective methods for feature selection, rough set theory provides a systematic theoretical framework for consistency-based feature selection, in which positive region-based dependency calculation is the most important step. However, it is time-consuming, and although many improved algorithms have been proposed, they are still computationally time-consuming. Therefore, to overcome this shortcoming, in this study, a nested equivalence class (NEC) approach is introduced to calculate dependency. The proposed method starts from the finest partition of the universe, and then extracts and uses the known knowledge of reducts in a decision table to construct an NEC. The proposed method not only simplifies dependency calculation but also reduces the universe correspondingly, in most cases. Using the proposed NEC-based approach, a number of representative heuristic- and swarm intelligence-based feature selection algorithms that apply rough set theory were enhanced. Note that the feature subset selected by each modified algorithm and that selected by the original algorithm were the same. Experiments conducted using 33 datasets from the UCI repository and KDD Cup competition, which included large-scale and high-dimensional datasets, demonstrated the efficiency and effectiveness of the proposed method. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:431 / 453
页数:23
相关论文
共 50 条
  • [31] A Multi-objective Feature Selection Approach Based on Binary PSO and Rough Set Theory
    Cervante, Liam
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    EVOLUTIONARY COMPUTATION IN COMBINATORIAL OPTIMIZATION (EVOCOP 2013), 2013, 7832 : 25 - +
  • [32] Unsupervised Feature Selection Based on the Measures of Degree of Dependency using Rough Set Theory in Digital Mammogram Image Classification
    Velayutham, C.
    Thangavel, K.
    2011 THIRD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2011, : 163 - 168
  • [33] Feature selection of dominance-based neighborhood rough set approach for processing hybrid ordered data
    Chen, Jiayue
    Zhu, Ping
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 167
  • [34] Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm
    Mohamed A. Tawhid
    Abdelmonem M. Ibrahim
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 573 - 602
  • [35] A rough set approach to feature selection based on scatter search metaheuristic
    Jue Wang
    Qi Zhang
    Hedar Abdel-Rahman
    M. Ibrahim Abdel-Monem
    Journal of Systems Science and Complexity, 2014, 27 : 157 - 168
  • [36] A ROUGH SET APPROACH TO FEATURE SELECTION BASED ON SCATTER SEARCH METAHEURISTIC
    Wang Jue
    Zhang Qi
    Abdel-Rahman, Hedar
    Abdel-Monem, M. Ibrahim
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2014, 27 (01) : 157 - 168
  • [37] A rough set approach to feature selection based on power set tree
    Chen, Yumin
    Miao, Duoqian
    Wang, Ruizhi
    Wu, Keshou
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (02) : 275 - 281
  • [38] BINARY PSO AND ROUGH SET THEORY FOR FEATURE SELECTION: A MULTI-OBJECTIVE FILTER BASED APPROACH
    Xue, Bing
    Cervante, Liam
    Shang, Lin
    Browne, Will
    Zhang, Mengjie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (02)
  • [39] Feature Selection Based on Neighborhood Systems and Rough Set Theory
    He, Ming
    WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 3 - 5
  • [40] New filter approaches for feature selection using differential evolution and fuzzy rough set theory
    Hancer, Emrah
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07) : 2929 - 2944