Global optimization for cardinality-constrained minimum sum-of-squares clustering via semidefinite programming

被引:3
|
作者
Piccialli, Veronica [1 ]
Sudoso, Antonio M. [1 ]
机构
[1] Sapienza Univ Rome, Dept Comp Control & Management Engn, Via Ariosto 25, I-00185 Rome, Italy
关键词
Global optimization; Constrained clustering; Semidefinite programming; Branch-and-cut; Distance geometry; K-MEANS; INITIALIZATION; RELAXATIONS; BOUNDS;
D O I
10.1007/s10107-023-02021-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The minimum sum-of-squares clustering (MSSC), or k-means type clustering, has been recently extended to exploit prior knowledge on the cardinality of each cluster. Such knowledge is used to increase performance as well as solution quality. In this paper, we propose a global optimization approach based on the branch-and-cut technique to solve the cardinality-constrained MSSC. For the lower bound routine, we use the semidefinite programming (SDP) relaxation recently proposed by Rujeerapaiboon et al. (SIAM J Optim 29(2):1211-1239, 2019). However, this relaxation can be used in a branch-and-cut method only for small-size instances. Therefore, we derive a new SDP relaxation that scales better with the instance size and the number of clusters. In both cases, we strengthen the bound by adding polyhedral cuts. Benefiting from a tailored branching strategy which enforces pairwise constraints, we reduce the complexity of the problems arising in the children nodes. For the upper bound, instead, we present a local search procedure that exploits the solution of the SDP relaxation solved at each node. Computational results show that the proposed algorithm globally solves, for the first time, real-world instances of size 10 times larger than those solved by state-of-the-art exact methods.
引用
收藏
页数:35
相关论文
共 50 条
  • [21] Analysis of global k-means, an incremental heuristic for minimum sum-of-squares clustering
    Hansen, P
    Ngai, E
    Cheung, BK
    Mladenovic, N
    JOURNAL OF CLASSIFICATION, 2005, 22 (02) : 287 - 310
  • [22] Analysis of Global k-Means, an Incremental Heuristic for Minimum Sum-of-Squares Clustering
    Pierre Hansen
    Eric Ngai
    Bernard K. Cheung
    Nenad Mladenovic
    Journal of Classification, 2005, 22 : 287 - 310
  • [23] Modified global k-means algorithm for minimum sum-of-squares clustering problems
    Bagirov, Adil M.
    PATTERN RECOGNITION, 2008, 41 (10) : 3192 - 3199
  • [24] A tabu search approach for the minimum sum-of-squares clustering problem
    Liu, Yongguo
    Yi, Zhang
    Wu, Hong
    Ye, Mao
    Chen, Kefei
    INFORMATION SCIENCES, 2008, 178 (12) : 2680 - 2704
  • [25] Strategic oscillation for the balanced minimum sum-of-squares clustering problem
    Martin-Santamaria, R.
    Sanchez-Oro, J.
    Perez-Pelo, S.
    Duarte, A.
    INFORMATION SCIENCES, 2022, 585 : 529 - 542
  • [26] An improved column generation algorithm for minimum sum-of-squares clustering
    Daniel Aloise
    Pierre Hansen
    Leo Liberti
    Mathematical Programming, 2012, 131 : 195 - 220
  • [27] An improved column generation algorithm for minimum sum-of-squares clustering
    Aloise, Daniel
    Hansen, Pierre
    Liberti, Leo
    MATHEMATICAL PROGRAMMING, 2012, 131 (1-2) : 195 - 220
  • [28] NP-Hardness of balanced minimum sum-of-squares clustering
    Pyatkin, Artem
    Aloise, Daniel
    Mladenovic, Nenad
    PATTERN RECOGNITION LETTERS, 2017, 97 : 44 - 45
  • [29] Continuous optimization approaches for clustering via minimum sum of squares
    Akteke-Ozturk, Basak
    Weber, Gerhard-Wilhelm
    Kropat, Erik
    20TH INTERNATIONAL CONFERENCE, EURO MINI CONFERENCE CONTINUOUS OPTIMIZATION AND KNOWLEDGE-BASED TECHNOLOGIES, EUROPT'2008, 2008, : 253 - +
  • [30] Variable neighborhood search for minimum sum-of-squares clustering on networks
    Carrizosa, Emilio
    Mladenovic, Nenad
    Todosijevic, Raca
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2013, 230 (02) : 356 - 363