Attribute weights-based clustering centres algorithm for initialising K-modes clustering

被引:3
|
作者
Liwen Peng
Yongguo Liu
机构
[1] University of Electronic Science and Technology of China,Knowledge and Data Engineering Laboratory of Chinese Medicine, School of Information and Software Engineering
来源
Cluster Computing | 2019年 / 22卷
关键词
Clustering centers; Weight; Density; Distance;
D O I
暂无
中图分类号
学科分类号
摘要
The K-modes algorithm based on partitional clustering technology is a very popular and effective clustering method; moreover, it handles categorical data. However, the performance of the K-modes method is largely affected by the initial clustering centres. Random selection of the initial clustering centres commonly leads to non-repeatable clustering result. Hence, suitable choice of the initial clustering centres is crucial to realizing high-performance K-modes clustering. The present article develops an initialisation algorithm for K-modes. At initialisation, the distance between two instances calculated after weighting the attributes of the instances. Many studies have shown that if clustering is based only on distances or density between the instances, the clustering revolves around one centre or the outliers. Therefore, based on the attribute weights, we combine the distance and density measures to select the clustering centres. In experiments on several UCI machine learning repository benchmark datasets, the new initialisation method outperformed the existing K-modes clustering methods.
引用
收藏
页码:6171 / 6179
页数:8
相关论文
共 50 条
  • [41] Initialization of K-modes clustering using outlier detection techniques
    Jiang, Feng
    Liu, Guozhu
    Du, Junwei
    Sui, Yuefei
    INFORMATION SCIENCES, 2016, 332 : 167 - 183
  • [42] Semantically Enhanced Clustering in Retail Using Possibilistic K-Modes
    Ammar, Asma
    Elouedi, Zied
    Lingras, Pawan
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 753 - 764
  • [43] The Impact of Cluster Representatives on the Convergence of the K-Modes Type Clustering
    Bai, Liang
    Liang, Jiye
    Dang, Chuangyin
    Cao, Fuyuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (06) : 1509 - 1522
  • [44] An Improved K-modes Clustering Algorithm Based on Intra-cluster and Inter-cluster Dissimilarity Measure
    Zhou, Hongfang
    Zhang, Yihui
    Liu, Yibin
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 410 - 418
  • [45] A Fast Parallel K-Modes Algorithm for Clustering Nucleotide Sequences to Predict Translation Initiation Sites
    Castro, Guilherme Torres
    Zarate, Luis Enrique
    Nobre, Cristiane Neri
    Freitas, Henrique Cota
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (05) : 442 - 456
  • [46] EGA-FMC: enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data
    Narasimhan, Medhini
    Balasubramanian, Balaji
    Kumar, Suryansh D.
    Patil, Nagamma
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2018, 11 (04) : 219 - 228
  • [47] Initial seed selection for K-modes clustering-A distance and density based approach
    Sajidha, S. A.
    Chodnekar, Siddha Prabhu
    Desikan, Kalyani
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (06) : 693 - 701
  • [48] Metaheuristic-based possibilistic fuzzy k-modes algorithms for categorical data clustering
    Kuo, R. J.
    Zheng, Y. R.
    Thi Phuong Quyen Nguyen
    INFORMATION SCIENCES, 2021, 557 : 1 - 15
  • [49] FKMAWCW: Categorical fuzzy k-modes clustering with automated attribute-weight and cluster-weight learning
    Oskouei, Amin Golzari
    Balafar, Mohammad Ali
    Motamed, Cina
    CHAOS SOLITONS & FRACTALS, 2021, 153
  • [50] Novel dynamic k-modes clustering of categorical and non categorical dataset with optimized genetic algorithm based feature selection
    Suryanarayana, G.
    Prakash, L. N. C. K.
    Mahesh, P. C. Senthil
    Bhaskar, T.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24399 - 24418