SCALING UP DISCRETE DISTRIBUTION CLUSTERING USING ADMM

被引:0
作者
Ye, Jianbo [1 ]
Li, Jia [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2014年
关键词
Discrete Distribution; Clustering; Large-Scale Learning; ADMM; EARTH-MOVERS-DISTANCE; ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The discrete distribution as a sparse representation, equipped with the Kantorovich-Wasserstein metric, has been proven effective in learning tasks on imagery data. However, clustering based on the Kantorovich metric under a principled optimization criterion is computationally challenging, and has not been adequately explored. In this paper, we focus on the scalability issue and develop a new algorithm for clustering distributions. An optimal centroid or representative distribution in the sense of the Kantorovich metric is solved for each cluster. The key idea is to adapt the state-of-the-art distributed optimization approach called alternating direction method of multipliers (ADMM). The new algorithm achieves linear complexity in the update of each centroid and can be easily parallelizable, improving significantly over the existing method. It is also observed that in practice, satisfactory results can be obtained after a few tens of iterations. We conduct experiments on both synthetic and real data to demonstrate the computational efficiency and accuracy of the new algorithm.
引用
收藏
页码:5267 / 5271
页数:5
相关论文
共 18 条
  • [1] [Anonymous], 2008, Computer Vision and Pattern Recognition
  • [2] [Anonymous], 1983, AUGMENTED LAGRANGIAN, DOI DOI 10.1016/S0168-2024(08)70034-1
  • [3] Applegate D., 2011, P 17 ACM SIGKDD INT, P636, DOI DOI 10.1145/2020408.2020508
  • [4] Displacement Interpolation Using Lagrangian Mass Transport
    Bonneel, Nicolas
    van de Panne, Michiel
    Paris, Sylvain
    Heidrich, Wolfgang
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (06):
  • [5] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
  • [6] ON THE DOUGLAS-RACHFORD SPLITTING METHOD AND THE PROXIMAL POINT ALGORITHM FOR MAXIMAL MONOTONE-OPERATORS
    ECKSTEIN, J
    BERTSEKAS, DP
    [J]. MATHEMATICAL PROGRAMMING, 1992, 55 (03) : 293 - 318
  • [7] Gabay D., 1976, Computers & Mathematics with Applications, V2, P17, DOI 10.1016/0898-1221(76)90003-1
  • [8] GLOWINSKI R, 1975, REV FR AUTOMAT INFOR, V9, P41
  • [9] Dynamic clustering of histogram data based on adaptive squared Wasserstein distances
    Irpino, Antonio
    Verde, Rosanna
    De Carvalho, Francisco de A. T.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (07) : 3351 - 3366
  • [10] Real-time computerized annotation of pictures
    Li, Jia
    Wang, James Z.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (06) : 985 - 1002