Systematic Feature Selection Based on Three-Level Improvements of Fuzzy Dominance Three-Way Neighborhood Rough Sets

被引:11
作者
Zhang, Xianyong [1 ]
Chen, Benwei [1 ]
Miao, Duoqian [2 ]
机构
[1] Sichuan Normal Univ, Sch Math Sci, Chengdu 610066, Peoples R China
[2] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; fuzzy dominance three-way neighborhood rough set (FD3NRS); granular computing (GrC); ordered decision system; three-way decision (3WD); uncertainty measurement; ATTRIBUTE REDUCTION; ENTROPY;
D O I
10.1109/TFUZZ.2024.3437367
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection facilitates system processing, and it relies on knowledge granulation and uncertainty measurement. Focusing on ordered decision systems, the fuzzy dominance neighborhood (FDN) granulation and corresponding condition entropy have recently yielded an outstanding algorithm for feature selection, FDNCE-FS (fuzzy dominance neighborhood condition entropy-based feature selection). However, there is room for improvement. Accordingly, three-level improvements of knowledge granulation, information enrichment, and heterogeneity fusion are proposed here, and $2\times 2\times 2=8$ heuristic algorithms of feature selection are systematically established. First, FDN granulation is improved to fuzzy dominance three-way neighborhood (FD3N) granulation through three-way decision on fuzzy dominance degrees, and FD3N rough sets are modeled to offer better dependency. Second, the FDN condition entropy is improved to FD3N condition entropy by reinforcing the interaction factor and class information, and corresponding measure systems are constructed. Third, FD3N dependency is fused with four types of condition entropy to produce four combined measures, and eight uncertainty measures hierarchically emerge due to the three-level improvements. Fourth, these systematic measures have granulation nonmonotonicity, and they enable heuristic algorithms for feature selection; thus, the current FDNCE-FS method is improved to seven new selection algorithms: FHN-FS, RHN-FS, RFHN-FS, HTWN-FS, FHTWN-FS, RHTWN-FS, and RFHTWN-FS. Finally, the relevant FD3N granulation, uncertainty measurement, and feature selection are validated by data-based experiments, and the seven novel algorithms are shown to outperform FDNCE-FS in terms of classification performance. This study provides new insights into uncertainty modeling, information fusion, and feature selection through granular computing and three-way decision.
引用
收藏
页码:5060 / 5072
页数:13
相关论文
共 37 条
[1]   Improved evolutionary-based feature selection technique using extension of knowledge based on the rough approximations [J].
Abd Elaziz, Mohamed ;
Abu-Donia, Hassan M. ;
Hosny, Rodyna A. ;
Hazae, Saeed L. ;
Ibrahim, Rehab Ali .
INFORMATION SCIENCES, 2022, 594 (76-94) :76-94
[2]   A survey on swarm intelligence approaches to feature selection in data mining [J].
Bach Hoai Nguyen ;
Xue, Bing ;
Zhang, Mengjie .
SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
[3]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[4]   A novel approach for learning label correlation with application to feature selection of multi-label data [J].
Che, Xiaoya ;
Chen, Degang ;
Mi, Jusheng .
INFORMATION SCIENCES, 2020, 512 :795-812
[5]   Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models [J].
Chen, Degang ;
Yang, Yanyan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) :1325-1334
[6]   Parallel attribute reduction in dominance-based neighborhood rough set [J].
Chen, Hongmei ;
Li, Tianrui ;
Cai, Yong ;
Luo, Chuan ;
Fujita, Hamido .
INFORMATION SCIENCES, 2016, 373 :351-368
[7]   An entropy-based uncertainty measurement approach in neighborhood systems [J].
Chen, Yumin ;
Wu, Keshou ;
Chen, Xuhui ;
Tang, Chaohui ;
Zhu, Qingxin .
INFORMATION SCIENCES, 2014, 279 :239-250
[8]   Dominance-based fuzzy rough set approach for incomplete interval-valued data [J].
Dai, Jianhua ;
Yan, Yuejun ;
Li, Zhaowen ;
Liao, Beishui .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (01) :423-436
[9]   Rough set approach to multiple criteria classification with imprecise evaluations and assignments [J].
Dembczynski, Krzysztof ;
Greco, Salvatore ;
Slowinski, Roman .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 198 (02) :626-636
[10]   A comprehensive survey on feature selection in the various fields of machine learning [J].
Dhal, Pradip ;
Azad, Chandrashekhar .
APPLIED INTELLIGENCE, 2022, 52 (04) :4543-4581