On the impact of dissimilarity measure in k-modes clustering algorithm

被引:140
作者
Ng, Michael K. [1 ]
Li, Mark Junjie
Huang, Joshua Zhexue
He, Zengyou
机构
[1] Hong Kong Baptist Univ, Dept Math, Hong Kong, Hong Kong, Peoples R China
[2] Univ Hong Kong, E Business Technol Inst, Hong Kong, Hong Kong, Peoples R China
[3] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
data mining; clustering; k-modes algorithm; categorical data;
D O I
10.1109/TPAMI.2007.53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This correspondence describes extensions to the k-modes algorithm for clustering categorical data. By modifying a simple matching dissimilarity measure for categorical objects, a heuristic approach was developed in [4], [12] which allows the use of the k- modes paradigm to obtain a cluster with strong intrasimilarity and to efficiently cluster large categorical data sets. The main aim of this paper is to rigorously derive the updating formula of the k- modes clustering algorithm with the new dissimilarity measure and the convergence of the algorithm under the optimization framework.
引用
收藏
页码:503 / 507
页数:5
相关论文
共 50 条
  • [1] A dissimilarity measure for the k-Modes clustering algorithm
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Bai, Liang
    Dang, Chuangyin
    KNOWLEDGE-BASED SYSTEMS, 2012, 26 : 120 - 127
  • [2] An Improved K-modes Clustering Algorithm Based on Intra-cluster and Inter-cluster Dissimilarity Measure
    Zhou, Hongfang
    Zhang, Yihui
    Liu, Yibin
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 410 - 418
  • [3] A Global K-modes Algorithm for Clustering Categorical Data
    Bai Tian
    Kulikowski, C. A.
    Gong Leiguang
    Yang Bin
    Huang Lan
    Zhou Chunguang
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (03): : 460 - 465
  • [4] A fuzzy k-modes algorithm for clustering categorical data
    Huang, ZX
    Ng, MK
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (04) : 446 - 452
  • [5] A dissimilarity measure for mixed nominal and ordinal attribute data in k-Modes algorithm
    Yuan, Fang
    Yang, Youlong
    Yuan, Tiantian
    APPLIED INTELLIGENCE, 2020, 50 (05) : 1498 - 1509
  • [6] A dissimilarity measure for mixed nominal and ordinal attribute data in k-Modes algorithm
    Fang Yuan
    Youlong Yang
    Tiantian Yuan
    Applied Intelligence, 2020, 50 : 1498 - 1509
  • [7] A note on K-modes clustering
    Huang, ZX
    Ng, MK
    JOURNAL OF CLASSIFICATION, 2003, 20 (02) : 257 - 261
  • [8] Genetic distance measure for K-modes algorithm
    Chiang, Ching-San
    Chu, Shu-Chuan
    Hsin, Yi-Chih
    Wang, Ming-Hui
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2006, 2 (01): : 33 - 40
  • [9] CLEKMODES: a modified k-modes clustering algorithm
    Mastrogiannis, N.
    Giannikos, I.
    Boutsinas, B.
    Antzoulatos, G.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (08) : 1085 - 1095
  • [10] K-modes clustering
    Chaturvedi, A
    Green, PE
    Carroll, JD
    JOURNAL OF CLASSIFICATION, 2001, 18 (01) : 35 - 55