Feature Selection Based on Difference and Similitude in Data Mining

被引:0
|
作者
WU Ming
机构
基金
中国国家自然科学基金;
关键词
knowledge reduction; feature selection; rough set; difference set; similitude set; attribute rank function;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Feature selection is the pretreatment of data mining. Heuristic search algorithms are often used for this subject. Many heuristic search algorithms are based on discernibility matrices,which only consider the difference in information system. Because the similar characteristics are not revealed in discernibility matrix,the result may not be the simplest rules. Although difference similitude(DS) methods take both of the difference and the similitude into account,the existing search strategy will cause some important features to be ignored. An improved DS based algorithm is proposed to solve this problem in this paper. An attribute rank function,which considers both of the difference and similitude in feature selection,is defined in the improved algorithm. Experiments show that it is an effective algorithm,especially for large-scale databases. The time complexity of the algorithm is O (|C |2|U|2).
引用
收藏
页码:467 / 470
页数:4
相关论文
共 50 条
  • [31] Performance Analysis of Feature Selection Algorithm for Educational Data Mining
    Zaffar, Maryam
    Hashmani, Manzoor Ahmed
    Savita, K. S.
    2017 IEEE CONFERENCE ON BIG DATA AND ANALYTICS (ICBDA), 2017, : 7 - 12
  • [32] Evaluating feature selection methods for learning in data mining applications
    Piramuthu, S
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 294 - 301
  • [33] A survey on swarm intelligence approaches to feature selection in data mining
    Bach Hoai Nguyen
    Xue, Bing
    Zhang, Mengjie
    SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
  • [34] Stable Feature Selection with Privacy Preserving Data Mining Algorithm
    Chelvan, Mohana P.
    Perumal, K.
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2017, 2017, 712 : 227 - 237
  • [35] Mining Clinical Pathway Based on Clustering and Feature Selection
    Iwata, Haruko
    Hirano, Shoji
    Tsumoto, Shusaku
    BRAIN AND HEALTH INFORMATICS, 2013, 8211 : 237 - 245
  • [36] New feature selection algorithm based on potential difference
    Liu, Guangyuan
    Liu, Yu
    Dong, Liyan
    Yuan, Senmiao
    Li, Yongli
    2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 566 - +
  • [37] A Hybrid Feature Selection Method for Effective Data Classification in Data Mining Applications
    Sangaiya, Ilangovan
    Kumar, A. Vincent Antony
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2019, 11 (01) : 1 - 16
  • [38] Fusion Feature Selection: New Insights into Feature Subset Detection in Biological Data Mining
    Athilakshmi, Rajangam
    Rajavel, Ramadoss
    Jacob, Shomona Gracia
    STUDIES IN INFORMATICS AND CONTROL, 2019, 28 (03): : 327 - 336
  • [39] Data mining algorithm based on feature weighting
    Qian, Zheng
    Xia, Hongxia
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2019, 19 (S1) : S269 - S276
  • [40] Research on the Application of Random Forest-based Feature Selection Algorithm in Data Mining Experiments
    Wang, Huan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 505 - 518