A novel feature selection approach based on clustering algorithm

被引:8
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [41] An Improved Fuzzy Feature Clustering and Selection based on Chi-Squared-Test
    Chitsaz, Elham
    Taheri, Mohammad
    Katebi, Seraj D.
    Jahromi, Mansour Zolghadri
    IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 35 - 40
  • [42] Feature selection based on partition clustering
    Liu, Shuang
    Zhao, Qiang
    Wu, Xiang
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (02) : 135 - 142
  • [43] CBFS: A Clustering-Based Feature Selection Mechanism for Network Anomaly Detection
    Mao, Jiewen
    Hu, Yongquan
    Jiang, Dong
    Wei, Tongquan
    Shen, Fuke
    IEEE ACCESS, 2020, 8 : 116216 - 116225
  • [44] A novel quantum grasshopper optimization algorithm for feature selection
    Wang, Dong
    Chen, Hongmei
    Li, Tianrui
    Wan, Jihong
    Huang, Yanyong
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 127 : 33 - 53
  • [45] Novel optimized crow search algorithm for feature selection
    Samieiyan, Behrouz
    MohammadiNasab, Poorya
    Mollaei, Mostafa Abbas
    Hajizadeh, Fahimeh
    Kangavari, Mohammadreza
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [46] New Feature Selection Algorithm Based on Feature Stability and Correlation
    Al-Shalabi, Luai
    IEEE ACCESS, 2022, 10 : 4699 - 4713
  • [47] A niching memetic algorithm for simultaneous clustering and feature selection
    Sheng, Weiguo
    Liu, Xiaohui
    Fairhurst, Michael
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (07) : 868 - 879
  • [48] A novel feature selection algorithm based on LVQ hypothesis margin
    Hu, Yaomin
    Liu, Weiming
    NEURAL COMPUTING & APPLICATIONS, 2014, 24 (06) : 1431 - 1439
  • [49] A novel hybrid algorithm for feature selection
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    Personal and Ubiquitous Computing, 2018, 22 : 971 - 985
  • [50] A novel hybrid algorithm for feature selection
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    PERSONAL AND UBIQUITOUS COMPUTING, 2018, 22 (5-6) : 971 - 985