Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引：1

作者：

Xu, Weihua ^{[1
]}

Yang, Yifei ^{[1
]}

Ding, Yi ^{[1
]}

Chen, Xiyang ^{[2
]}

Lv, Xiaofang ^{[3
]}

机构：

[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China

[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China

[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;

D O I：

10.1007/s10489-024-05411-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.

引用

页码：4890 / 4910

页数：21

共 50 条

[21] Active Incremental Feature Selection Using a Fuzzy-Rough-Set-Based Information Entropy
Zhang, Xiao
Mei, Changlin
Chen, Degang
Yang, Yanyan
Li, Jinhai
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 901 - 915
[22] Granular-ball-matrix-based incremental semi-supervised feature selection approach to high-dimensional variation using neighbourhood discernibility degree for ordered partially labelled dataset
Xu, Weihua
Li, Jinlong
APPLIED INTELLIGENCE, 2025, 55 (04)
[23] Parallel Rough Set: Dimensionality Reduction and Feature Discovery of Multi-dimensional Data in Visualization
Huang, Tze-Haw
Huang, Mao Lin
Jin, Jesse S.
NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 99 - +
[24] An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets
Yanzhou Pan
Weihua Xu
Qinwen Ran
International Journal of Machine Learning and Cybernetics, 2023, 14 : 1217 - 1233
[25] A matrix-based incremental attribute reduction approach under knowledge granularity on the variation of attribute set
Jing, Yunge
Li, Tianrui
2015 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE), 2015, : 34 - 39
[26] Feature Selection Using Approximate Conditional Entropy Based on Fuzzy Information Granule for Gene Expression Data Classification
Zhang, Hengyi
FRONTIERS IN GENETICS, 2021, 12
[27] Incremental updating approximations in dominance-based rough sets approach under the variation of the attribute set
Li, Shaoyong
Li, Tianrui
Liu, Dun
KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 17 - 26
[28] Feature selection based on multi-perspective dynamic neighbourhood entropy measures in a dynamic neighbourhood rough set
Xu, Jiucheng
Ma, Miaoxian
Zhang, Shan
Niu, Wulin
APPLIED INTELLIGENCE, 2025, 55 (06)
[29] Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy
Zhang, Xiao
Mei, Changlin
Chen, Degang
Li, Jinhai
PATTERN RECOGNITION, 2016, 56 : 1 - 15
[30] An incremental feature selection approach based on scatter matrices for classification of cancer microarray data
Sardana, Manju
Agrawal, R. K.
Kaur, Baljeet
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2015, 92 (02) : 277 - 295

← 1 2 3 4 5 →