An ellipsoidal K-means for document clustering

被引:3
作者
Dzogang, Fabon [1 ]
Marsala, Christophe [1 ]
Lesot, Marie-Jeanne [1 ]
Rifqi, Maria [2 ,3 ]
机构
[1] Univ Paris 06, UMR7606, LIP6, Paris, France
[2] LIP6, Paris, France
[3] Univ Pantheon Assas, Paris, France
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012) | 2012年
关键词
clustering; feature selection; spherical k-means; information retrieval;
D O I
10.1109/ICDM.2012.126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an extension of the spherical K-means algorithm to deal with settings where the number of data points is largely inferior to the number of dimensions. We assume the data to lie in local and dense regions of the original space and we propose to embed each cluster into its specific ellipsoid. A new objective function is introduced, analytical solutions are derived for both the centroids and the associated ellipsoids. Furthermore, a study on the complexity of this algorithm highlights that it is of same order as the regular K-means algorithm. Results on both synthetic and real data show the efficiency of the proposed method.
引用
收藏
页码:221 / 230
页数:10
相关论文
共 50 条
  • [41] A Comparative Study of K-Means, K-Means plus plus and Fuzzy C-Means Clustering Algorithms
    Kapoor, Akanksha
    Singhal, Abhishek
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [42] Fast K-means for Large Scale Clustering
    Hu, Qinghao
    Wu, Jiaxiang
    Bai, Lu
    Zhang, Yifan
    Cheng, Jian
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2099 - 2102
  • [43] A Novel ELM K-Means Algorithm for Clustering
    Alshamiri, Abobakr Khalil
    Surampudi, Bapi Raju
    Singh, Alok
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, SEMCCO 2014, 2015, 8947 : 212 - 222
  • [44] Entropy Weighted Power k-Means Clustering
    Chakraborty, Saptarshi
    Paul, Debolina
    Das, Swagatam
    Xu, Jason
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 691 - 700
  • [45] A k-means approach to clustering disease progressions
    Duc Thanh Anh Luong
    Chandola, Varun
    2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 268 - 274
  • [46] A Novel MapReduce Based k-Means Clustering
    Sinha, Ankita
    Jana, Prasanta K.
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COMMUNICATION, 2017, 458 : 247 - 255
  • [47] Randomized Dimensionality Reduction for k-Means Clustering
    Boutsidis, Christos
    Zouzias, Anastasios
    Mahoney, Michael W.
    Drineas, Petros
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (02) : 1045 - 1062
  • [48] Seeding on Samples for Accelerating K-Means Clustering
    Low, Jia Shun
    Ghafoori, Zahra
    Bezdek, James C.
    Leckie, Christopher
    3RD INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2019), 2018, : 41 - 45
  • [49] Comparison of conventional and rough K-means clustering
    Lingras, P
    Yan, R
    West, C
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2003, 2639 : 130 - 137
  • [50] An Efficient Global K-means Clustering Algorithm
    Xie, Juanying
    Jiang, Shuai
    Xie, Weixin
    Gao, Xinbo
    JOURNAL OF COMPUTERS, 2011, 6 (02) : 271 - 279