Modified global k-means algorithm for minimum sum-of-squares clustering problems

被引:143
作者
Bagirov, Adil M. [1 ]
机构
[1] Univ Ballarat, Ctr Informat & Appl Optimizat, Sch Informat Technol & Math Sci, Ballarat, Vic 3353, Australia
关键词
minimum sum-of-squares clustering; nonsmooth optimization; k-means algorithm; global k-means algorithm;
D O I
10.1016/j.patcog.2008.04.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
k-Means algorithm and its variations are known to be fast clustering algorithms. However, they are sensitive to the choice of starting points and inefficient for solving clustering problems in large data sets. Recently, a new version of the k-means algorithm, the global k-means algorithm has been developed. It is an incremental algorithm that dynamically adds one cluster center at a time and uses each data point as a candidate for the k-th cluster center. Results of numerical experiments show that the global k-means algorithm considerably outperforms the k-means algorithms. In this paper, a new version of the global k-means algorithm is proposed. A starting point for the k-th cluster center in this algorithm is computed by minimizing an auxiliary cluster function. Results of numerical experiments on 14 data sets demonstrate the superiority of the new algorithm, however, it requires more computational time than the global k-means algorithm. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3192 / 3199
页数:8
相关论文
共 23 条
  • [1] A TABU SEARCH APPROACH TO THE CLUSTERING PROBLEM
    ALSULTAN, KS
    [J]. PATTERN RECOGNITION, 1995, 28 (09) : 1443 - 1451
  • [2] [Anonymous], 1991, ORSA J COMPUT
  • [3] A Global Optimization Approach to Classification
    Bagirov, Adil M.
    Rubinov, Alexander M.
    Yearwood, John
    [J]. OPTIMIZATION AND ENGINEERING, 2002, 3 (02) : 129 - 155
  • [4] A new nonsmooth optimization algorithm for minimum sum-of-squares clustering problems
    Bagirov, AM
    Yearwood, J
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 170 (02) : 578 - 596
  • [5] BAGIROV AM, 2003, SPANISH OPER RES J, V11, P1
  • [6] Bock H. H., 1998, ADV DATA SCI CLASSIF, P265
  • [7] A PRACTICAL APPLICATION OF SIMULATED ANNEALING TO CLUSTERING
    BROWN, DE
    HUNTLEY, CL
    [J]. PATTERN RECOGNITION, 1992, 25 (04) : 401 - 412
  • [8] EVALUATION OF A BRANCH AND BOUND ALGORITHM FOR CLUSTERING
    DIEHR, G
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1985, 6 (02): : 268 - 284
  • [9] An interior point algorithm for minimum sum-of-squares clustering
    Du Merle, O
    Hansen, P
    Jaumard, B
    Mladenovic, N
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2000, 21 (04) : 1485 - 1505
  • [10] CLUSTERING TECHNIQUES - USERS DILEMMA
    DUBES, R
    JAIN, AK
    [J]. PATTERN RECOGNITION, 1976, 8 (04) : 247 - 260