A novel feature selection approach based on clustering algorithm

被引:7
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [41] A Novel Feature Selection by Clustering Coefficients of Variations
    Fong, Simon
    Liang, Justin
    Wong, Raymond
    Ghanavati, Mojgan
    2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2014, : 205 - 210
  • [42] A novel feature-selection approach based on the cuttlefish optimization algorithm for intrusion detection systems
    Eesa, Adel Sabry
    Orman, Zeynep
    Brifcani, Adnan Mohsin Abdulazeez
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (05) : 2670 - 2679
  • [43] A Novel Approach for Feature Selection
    Swapna, Ch. Swetha
    Kumar, V. Vijaya
    Murthy, J. V. R.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 877 - 885
  • [44] A Hybrid Approach for Feature Selection Based on Genetic Algorithm and Recursive Feature Elimination
    Rani, Pooja
    Kumar, Rajneesh
    Jain, Anurag
    Chawla, Sunil Kumar
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2021, 12 (02) : 17 - 38
  • [45] Feature selection based on partition clustering
    Liu, Shuang
    Zhao, Qiang
    Wu, Xiang
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (02) : 135 - 142
  • [46] Clustering-based feature selection
    School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China
    Tien Tzu Hsueh Pao, 2008, SUPPL. (157-160):
  • [47] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [48] A novel feature selection algorithm based on LVQ hypothesis margin
    Hu, Yaomin
    Liu, Weiming
    NEURAL COMPUTING & APPLICATIONS, 2014, 24 (06): : 1431 - 1439
  • [49] Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection
    Guha, Ritam
    Ghosh, Manosij
    Chakrabarti, Akash
    Sarkar, Ram
    Mirjalili, Seyedali
    APPLIED SOFT COMPUTING, 2020, 93
  • [50] A human body physiological feature selection algorithm based on filtering and improved clustering
    Chen, Bo
    Yu, Jie
    Gao, Xiu-e
    Zheng, Qing-Guo
    PLOS ONE, 2018, 13 (10):