Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引:1
作者
Xu, Weihua [1 ]
Yang, Yifei [1 ]
Ding, Yi [1 ]
Chen, Xiyang [2 ]
Lv, Xiaofang [3 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China
[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;
D O I
10.1007/s10489-024-05411-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.
引用
收藏
页码:4890 / 4910
页数:21
相关论文
共 50 条
  • [21] Active Incremental Feature Selection Using a Fuzzy-Rough-Set-Based Information Entropy
    Zhang, Xiao
    Mei, Changlin
    Chen, Degang
    Yang, Yanyan
    Li, Jinhai
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 901 - 915
  • [22] Granular-ball-matrix-based incremental semi-supervised feature selection approach to high-dimensional variation using neighbourhood discernibility degree for ordered partially labelled dataset
    Xu, Weihua
    Li, Jinlong
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [23] Parallel Rough Set: Dimensionality Reduction and Feature Discovery of Multi-dimensional Data in Visualization
    Huang, Tze-Haw
    Huang, Mao Lin
    Jin, Jesse S.
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 99 - +
  • [24] An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets
    Yanzhou Pan
    Weihua Xu
    Qinwen Ran
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1217 - 1233
  • [25] A matrix-based incremental attribute reduction approach under knowledge granularity on the variation of attribute set
    Jing, Yunge
    Li, Tianrui
    2015 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE), 2015, : 34 - 39
  • [26] Feature Selection Using Approximate Conditional Entropy Based on Fuzzy Information Granule for Gene Expression Data Classification
    Zhang, Hengyi
    FRONTIERS IN GENETICS, 2021, 12
  • [27] Incremental updating approximations in dominance-based rough sets approach under the variation of the attribute set
    Li, Shaoyong
    Li, Tianrui
    Liu, Dun
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 17 - 26
  • [28] Feature selection based on multi-perspective dynamic neighbourhood entropy measures in a dynamic neighbourhood rough set
    Xu, Jiucheng
    Ma, Miaoxian
    Zhang, Shan
    Niu, Wulin
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [29] Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy
    Zhang, Xiao
    Mei, Changlin
    Chen, Degang
    Li, Jinhai
    PATTERN RECOGNITION, 2016, 56 : 1 - 15
  • [30] An incremental feature selection approach based on scatter matrices for classification of cancer microarray data
    Sardana, Manju
    Agrawal, R. K.
    Kaur, Baljeet
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2015, 92 (02) : 277 - 295