Feature Selection Based on Difference and Similitude in Data Mining

被引：0

作者：

WU Ming

机构：

来源：

Wuhan University Journal of Natural Sciences | 2007年 / 03期

基金：

中国国家自然科学基金;

关键词：

knowledge reduction; feature selection; rough set; difference set; similitude set; attribute rank function;

D O I：

暂无

中图分类号：

TP311.13 [];

学科分类号：

1201 ;

摘要：

Feature selection is the pretreatment of data mining. Heuristic search algorithms are often used for this subject. Many heuristic search algorithms are based on discernibility matrices,which only consider the difference in information system. Because the similar characteristics are not revealed in discernibility matrix,the result may not be the simplest rules. Although difference similitude(DS) methods take both of the difference and the similitude into account,the existing search strategy will cause some important features to be ignored. An improved DS based algorithm is proposed to solve this problem in this paper. An attribute rank function,which considers both of the difference and similitude in feature selection,is defined in the improved algorithm. Experiments show that it is an effective algorithm,especially for large-scale databases. The time complexity of the algorithm is O (|C |2|U|2).

引用

页码：467 / 470

页数：4

共 50 条

[31] Performance Analysis of Feature Selection Algorithm for Educational Data Mining
Zaffar, Maryam
Hashmani, Manzoor Ahmed
Savita, K. S.
2017 IEEE CONFERENCE ON BIG DATA AND ANALYTICS (ICBDA), 2017, : 7 - 12
[32] Evaluating feature selection methods for learning in data mining applications
Piramuthu, S
PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 294 - 301
[33] A survey on swarm intelligence approaches to feature selection in data mining
Bach Hoai Nguyen
Xue, Bing
Zhang, Mengjie
SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
[34] Stable Feature Selection with Privacy Preserving Data Mining Algorithm
Chelvan, Mohana P.
Perumal, K.
ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2017, 2017, 712 : 227 - 237
[35] Mining Clinical Pathway Based on Clustering and Feature Selection
Iwata, Haruko
Hirano, Shoji
Tsumoto, Shusaku
BRAIN AND HEALTH INFORMATICS, 2013, 8211 : 237 - 245
[36] New feature selection algorithm based on potential difference
Liu, Guangyuan
Liu, Yu
Dong, Liyan
Yuan, Senmiao
Li, Yongli
2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 566 - +
[37] A Hybrid Feature Selection Method for Effective Data Classification in Data Mining Applications
Sangaiya, Ilangovan
Kumar, A. Vincent Antony
INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2019, 11 (01) : 1 - 16
[38] Fusion Feature Selection: New Insights into Feature Subset Detection in Biological Data Mining
Athilakshmi, Rajangam
Rajavel, Ramadoss
Jacob, Shomona Gracia
STUDIES IN INFORMATICS AND CONTROL, 2019, 28 (03): : 327 - 336
[39] Data mining algorithm based on feature weighting
Qian, Zheng
Xia, Hongxia
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2019, 19 (S1) : S269 - S276
[40] Research on the Application of Random Forest-based Feature Selection Algorithm in Data Mining Experiments
Wang, Huan
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 505 - 518

← 1 2 3 4 5 →