Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引：1

作者：

Xu, Weihua ^{[1
]}

Yang, Yifei ^{[1
]}

Ding, Yi ^{[1
]}

Chen, Xiyang ^{[2
]}

Lv, Xiaofang ^{[3
]}

机构：

[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China

[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China

[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;

D O I：

10.1007/s10489-024-05411-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.

引用

页码：4890 / 4910

页数：21

共 50 条

[31] Matrix-based incremental feature selection method using weight-partitioned multigranulation rough set
Xu, Weihua
Bu, Qinyuan
INFORMATION SCIENCES, 2024, 681
[32] A Novel Data-Driven Tropical Cyclone Track Prediction Model Based on CNN and GRU With Multi-Dimensional Feature Selection
Lian, Jie
Dong, Pingping
Zhang, Yuping
Pan, Jianguo
Liu, Kehao
IEEE ACCESS, 2020, 8 : 97114 - 97128
[33] Feature Selection Using Generalized Multi-Granulation Dominance Neighborhood Rough Set Based on Weight Partition
Xu, Weihua
Bu, Qinyuan
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 213 - 227
[34] A Cooperative Coevolutionary Approach to Discretization-Based Feature Selection for High-Dimensional Data
Zhou, Yu
Kang, Junhao
Zhang, Xiao
ENTROPY, 2020, 22 (06)
[35] Dominance relation-based feature selection for interval-valued multi-label ordered information system
Qin, Yujie
Lin, Guoping
Lin, Yidong
Kou, Yi
Hu, Wenyue
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
[36] A Multi-objective Feature Selection Approach Based on Binary PSO and Rough Set Theory
Cervante, Liam
Xue, Bing
Shang, Lin
Zhang, Mengjie
EVOLUTIONARY COMPUTATION IN COMBINATORIAL OPTIMIZATION (EVOCOP 2013), 2013, 7832 : 25 - +
[37] Clustering-based Sequential Feature Selection Approach for High Dimensional Data Classification
Alimoussa, M.
Porebski, A.
Vandenbroucke, N.
Thami, R. Oulad Haj
El Fkihi, S.
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 122 - 132
[38] A hybrid feature selection approach based on ensemble method for high-dimensional data
Rouhi, Amirreza
Nezamabadi-pour, Hossein
2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), 2017, : 16 - 20
[39] Semi-supervised feature selection for partially labeled mixed-type data based on multi-criteria measure approach
Shu, Wenhao
Yu, Jianhui
Yan, Zhenchao
Qian, Wenbin
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 153 : 258 - 279
[40] A Label Correlation Based Weighting Feature Selection Approach for Multi-label Data
Liu, Lu
Zhang, Jing
Li, Peipei
Zhang, Yuhong
Hu, Xuegang
WEB-AGE INFORMATION MANAGEMENT, PT II, 2016, 9659 : 369 - 379

← 1 2 3 4 5 →