When do birds of a feather flock together? k-Means, proximity, and conic programming

被引:18
|
作者
Li, Xiaodong [1 ]
Li, Yang [2 ]
Ling, Shuyang [3 ,4 ]
Strohmer, Thomas [2 ]
Wei, Ke [5 ]
机构
[1] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
[2] Univ Calif Davis, Dept Math, Davis, CA 95616 USA
[3] Courant Inst Math Sci, New York, NY 10012 USA
[4] Ctr Data Sci, New York, NY 10012 USA
[5] Fudan Univ, Sch Data Sci, Shanghai 200433, Peoples R China
关键词
Convex relaxation; k-Means; Clustering; Gaussian mixture model; AUGMENTED LAGRANGIAN METHOD; SEMIDEFINITE; MIXTURES;
D O I
10.1007/s10107-018-1333-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given a set of data, one central goal is to group them into clusters based on some notion of similarity between the individual objects. One of the most popular and widely-used approaches is k-means despite the computational hardness to find its global minimum. We study and compare the properties of different convex relaxations by relating them to corresponding proximity conditions, an idea originally introduced by Kumar and Kannan. Using conic duality theory, we present an improved proximity condition under which the Peng-Wei relaxation of k-means recovers the underlying clusters exactly. Our proximity condition improves upon Kumar and Kannan and is comparable to that of Awashti and Sheffet, where proximity conditions are established for projective k-means. In addition, we provide a necessary proximity condition for the exactness of the Peng-Wei relaxation. For the special case of equal cluster sizes, we establish a different and completely localized proximity condition under which the Amini-Levina relaxation yields exact clustering, thereby having addressed an open problem by Awasthi and Sheffet in the balanced case. Our framework is not only deterministic and model-free but also comes with a clear geometric meaning which allows for further analysis and generalization. Moreover, it can be conveniently applied to analyzing various data generative models such as the stochastic ball models and Gaussian mixture models. With this method, we improve the current minimum separation bound for the stochastic ball models and achieve the state-of-the-art results of learning Gaussian mixture models.
引用
收藏
页码:295 / 341
页数:47
相关论文
共 50 条
  • [1] When do birds of a feather flock together? k-Means, proximity, and conic programming
    Xiaodong Li
    Yang Li
    Shuyang Ling
    Thomas Strohmer
    Ke Wei
    Mathematical Programming, 2020, 179 : 295 - 341
  • [2] Do birds of a feather flock together?: Economic linkage and geographic proximity
    Jungyul Sohn
    The Annals of Regional Science, 2004, 38 : 47 - 73
  • [3] Do birds of a feather flock together?: Economic linkage and geographic proximity
    Sohn, J
    ANNALS OF REGIONAL SCIENCE, 2004, 38 (01): : 47 - 73
  • [4] Birds of a feather - Do they flock together?
    Schaffner, W
    INFECTION CONTROL AND HOSPITAL EPIDEMIOLOGY, 1997, 18 (03): : 162 - 164
  • [5] Do Birds of a Feather Flock Together?
    Curry, Oliver
    Dunbar, Robin I. M.
    HUMAN NATURE-AN INTERDISCIPLINARY BIOSOCIAL PERSPECTIVE, 2013, 24 (03): : 336 - 347
  • [6] Do birds of a feather flock together in China?
    Chen, Hao
    Luo, Shanhong
    Yue, Guoan
    Xu, Dan
    Zhaoyang, Ruixue
    PERSONAL RELATIONSHIPS, 2009, 16 (02) : 167 - 186
  • [7] Depression and Diabetic Peripheral Neuropathy: Birds of a Feather, But When do They Flock Together?
    Vas, Prashanth R. J.
    Papanas, Nikolaos
    EXPERIMENTAL AND CLINICAL ENDOCRINOLOGY & DIABETES, 2020, 128 (05) : 347 - 349
  • [8] Birds of a Feather, Do Sanctioned States Flock Together?
    Early, Bryan R.
    FOREIGN POLICY ANALYSIS, 2021, 17 (03)
  • [9] Birds of a feather flock together
    Mark Peifer
    Nature, 1998, 395 : 324 - 325
  • [10] Birds of a feather flock together?
    Eloire, Fabien
    ACTES DE LA RECHERCHE EN SCIENCES SOCIALES, 2014, (205) : 104 - +