A novel feature selection approach based on clustering algorithm

被引:8
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [21] Feature Selection Approach based on Moth-Flame Optimization Algorithm
    Zawbaa, Hossam M.
    Emary, E.
    Parv, B.
    Sharawi, Marwa
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 4612 - 4617
  • [22] A survey on feature selection approaches for clustering
    Hancer, Emrah
    Xue, Bing
    Zhang, Mengjie
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (06) : 4519 - 4545
  • [23] Parasitism - Predation algorithm (PPA): A novel approach for feature selection
    Mohamed, Al-Attar A.
    Hassan, S. A.
    Hemeida, A. M.
    Alkhalaf, Salem
    Mahmoud, M. M. M.
    Eldin, Ayman M. Baha
    AIN SHAMS ENGINEERING JOURNAL, 2020, 11 (02) : 293 - 308
  • [24] Binary Peacock Algorithm: A Novel Metaheuristic Approach for Feature Selection
    Banati, Hema
    Sharma, Richa
    Yadav, Asha
    JOURNAL OF CLASSIFICATION, 2024, 41 (02) : 216 - 244
  • [25] A novel filter feature selection algorithm based on relief
    Xueting Cui
    Ying Li
    Jiahao Fan
    Tan Wang
    Applied Intelligence, 2022, 52 : 5063 - 5081
  • [26] A novel filter feature selection algorithm based on relief
    Cui, Xueting
    Li, Ying
    Fan, Jiahao
    Wang, Tan
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5063 - 5081
  • [27] A Novel Automatic Grouping Algorithm for Feature Selection
    Yuan, Qiulong
    Fang, Yuchun
    COMPUTER VISION, PT III, 2017, 773 : 592 - 603
  • [28] A Novel Crowding Clustering Algorithm for Unsupervised and Supervised Filter Feature Selection Problem
    Ghanem, Khadoudja
    Layeb, Abdesslem
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [29] A harmony search algorithm for clustering with feature selection
    Cobos, Carlos
    Leon, Elizabeth
    Mendoza, Martha
    REVISTA FACULTAD DE INGENIERIA-UNIVERSIDAD DE ANTIOQUIA, 2010, (55): : 153 - 164
  • [30] Balanced Spectral Clustering Algorithm Based on Feature Selection
    Luo, Qimin
    Lu, Guangquan
    Wen, Guoqiu
    Su, Zidong
    Liu, Xingyi
    Wei, Jian
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 356 - 367