Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引:1
作者
Xu, Weihua [1 ]
Yang, Yifei [1 ]
Ding, Yi [1 ]
Chen, Xiyang [2 ]
Lv, Xiaofang [3 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China
[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;
D O I
10.1007/s10489-024-05411-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.
引用
收藏
页码:4890 / 4910
页数:21
相关论文
共 50 条
  • [11] A Novel Online Multi-label Feature Selection Approach for Multi-dimensional Streaming Data
    Zhang, Zhanyun
    Luo, Chuan
    Li, Tianrui
    Chen, Hongmei
    Liu, Dun
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 159 - 171
  • [12] BSSReduce an O(|U|) Incremental Feature Selection Approach for Large-Scale and High-Dimensional Data
    Gong, Ke
    Wang, Yong
    Xu, Maozeng
    Xiao, Zhi
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (06) : 3356 - 3367
  • [13] A Rough Set Approach to Feature Selection Based on Relative Decision Entropy
    Zhou, Lin
    Jiang, Feng
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2011, 6954 : 110 - 119
  • [14] Rough set Theory-Based group incremental approach to feature selection
    Zhao, Jie
    Wu, Dai-yang
    Zhou, Yong-xin
    Liang, Jia-ming
    Wei, WenHong
    Li, Yun
    INFORMATION SCIENCES, 2024, 675
  • [15] Efficient updating rough approximations with multi-dimensional variation of ordered data
    Wang, Shu
    Li, Tianrui
    Luo, Chuan
    Fujita, Hamido
    INFORMATION SCIENCES, 2016, 372 : 690 - 708
  • [16] Feature Subset Selection Approach Based on Fuzzy Rough Set for High-dimensional Data
    Guo, Changyou
    Zheng, Xuefeng
    2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 72 - 75
  • [17] An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets
    Pan, Yanzhou
    Xu, Weihua
    Ran, Qinwen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (04) : 1217 - 1233
  • [18] Feature selection in a discrete feature space based on fuzzy conditional information entropy iterative model and matrix operation
    Li, Zhaowen
    Chen, Yiying
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2023, 52 (05) : 597 - 635
  • [19] Feature selection for multiset-valued data based on fuzzy conditional information entropy using iterative model and matrix operation
    Huang, Dan
    Chen, Yiying
    Liu, Fang
    Li, Zhaowen
    APPLIED SOFT COMPUTING, 2023, 142
  • [20] A Dynamic Dominance-Based Rough Set Approach for Processing Ordered Data
    Li, Shaoyong
    Hong, Zhiyong
    ROUGH SETS, IJCRS 2019, 2019, 11499 : 312 - 320