Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引:1
作者
Xu, Weihua [1 ]
Yang, Yifei [1 ]
Ding, Yi [1 ]
Chen, Xiyang [2 ]
Lv, Xiaofang [3 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China
[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;
D O I
10.1007/s10489-024-05411-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.
引用
收藏
页码:4890 / 4910
页数:21
相关论文
共 50 条
  • [31] Matrix-based incremental feature selection method using weight-partitioned multigranulation rough set
    Xu, Weihua
    Bu, Qinyuan
    INFORMATION SCIENCES, 2024, 681
  • [32] A Novel Data-Driven Tropical Cyclone Track Prediction Model Based on CNN and GRU With Multi-Dimensional Feature Selection
    Lian, Jie
    Dong, Pingping
    Zhang, Yuping
    Pan, Jianguo
    Liu, Kehao
    IEEE ACCESS, 2020, 8 : 97114 - 97128
  • [33] Feature Selection Using Generalized Multi-Granulation Dominance Neighborhood Rough Set Based on Weight Partition
    Xu, Weihua
    Bu, Qinyuan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 213 - 227
  • [34] A Cooperative Coevolutionary Approach to Discretization-Based Feature Selection for High-Dimensional Data
    Zhou, Yu
    Kang, Junhao
    Zhang, Xiao
    ENTROPY, 2020, 22 (06)
  • [35] Dominance relation-based feature selection for interval-valued multi-label ordered information system
    Qin, Yujie
    Lin, Guoping
    Lin, Yidong
    Kou, Yi
    Hu, Wenyue
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [36] A Multi-objective Feature Selection Approach Based on Binary PSO and Rough Set Theory
    Cervante, Liam
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    EVOLUTIONARY COMPUTATION IN COMBINATORIAL OPTIMIZATION (EVOCOP 2013), 2013, 7832 : 25 - +
  • [37] Clustering-based Sequential Feature Selection Approach for High Dimensional Data Classification
    Alimoussa, M.
    Porebski, A.
    Vandenbroucke, N.
    Thami, R. Oulad Haj
    El Fkihi, S.
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 122 - 132
  • [38] A hybrid feature selection approach based on ensemble method for high-dimensional data
    Rouhi, Amirreza
    Nezamabadi-pour, Hossein
    2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), 2017, : 16 - 20
  • [39] Semi-supervised feature selection for partially labeled mixed-type data based on multi-criteria measure approach
    Shu, Wenhao
    Yu, Jianhui
    Yan, Zhenchao
    Qian, Wenbin
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 153 : 258 - 279
  • [40] A Label Correlation Based Weighting Feature Selection Approach for Multi-label Data
    Liu, Lu
    Zhang, Jing
    Li, Peipei
    Zhang, Yuhong
    Hu, Xuegang
    WEB-AGE INFORMATION MANAGEMENT, PT II, 2016, 9659 : 369 - 379