Improving k-means Clustering with Genetic Programming for Feature Construction

被引:3
|
作者
Lensen, Andrew [1 ]
Xue, Bing [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
来源
PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION) | 2017年
关键词
Cluster Analysis; Feature Construction; Genetic Programming; k-means; Evolutionary Computation;
D O I
10.1145/3067695.3075962
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
k-means is one of the most commonly used clustering algorithms in data mining. Despite this, it has a number of fundamental limitations which prevent it from performing effectively on large or otherwise difficult datasets. A common technique to improve performance of data mining algorithms is feature construction, a technique which combines features together to produce more powerful constructed features that can improve the performance of a given algorithm. Genetic Programming (GP) has been used for feature construction very successfully, due to its program-like structure. This paper proposes two representations for using GP to perform feature construction to improve the performance of k-means, using a wrapper approach. Our results show significant improvements in performance compared to k-means using all original features across six difficult datasets.
引用
收藏
页码:237 / 238
页数:2
相关论文
共 50 条
  • [31] A Coloured Image Watermarking Based on Genetic K-Means Clustering Methodology
    Hassan, Zainab Falah
    Al-Shareefi, Farah
    Gheni, Hadeel Qasem
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (02) : 242 - 249
  • [32] Enhancing Stock Prediction Clustering Using K-Means with Genetic Algorithm
    Desokey, Eslam Nader
    Badr, Amr
    Hegazy, Abdel Fatah
    2017 13TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2017, : 256 - 261
  • [33] Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering
    de Amorim, Renato Cordeiro
    Mirkin, Boris
    PATTERN RECOGNITION, 2012, 45 (03) : 1061 - 1075
  • [34] Broad Learning System: Feature extraction based on K-means clustering algorithm
    Liu, Zhulin
    Zhou, Jin
    Chen, C. L. Philip
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 683 - 687
  • [35] Entropy K-Means Clustering With Feature Reduction Under Unknown Number of Clusters
    Sinaga, Kristina P.
    Hussain, Ishtiaq
    Yang, Miin-Shen
    IEEE ACCESS, 2021, 9 : 67736 - 67751
  • [36] A hybrid clustering technique combining a novel genetic algorithm with K-Means
    Rahman, Md Anisur
    Islam, Md Zahidul
    KNOWLEDGE-BASED SYSTEMS, 2014, 71 : 345 - 365
  • [37] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [38] Random Projection for k-means Clustering
    Sieranoja, Sami
    Franti, Pasi
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2018, PT I, 2018, 10841 : 680 - 689
  • [39] A notion of stability for k-means clustering
    Le Gouic, T.
    Paris, Q.
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (02): : 4239 - 4263
  • [40] The MinMax k-Means clustering algorithm
    Tzortzis, Grigorios
    Likas, Aristidis
    PATTERN RECOGNITION, 2014, 47 (07) : 2505 - 2516