A modified Fuzzy k-Partition based on indiscernibility relation for categorical data clustering

被引:17
|
作者
Yanto, Iwan Tri Riyadi [1 ]
Ismail, Maizatul Akmar [2 ]
Herawan, Tutut [2 ]
机构
[1] Univ Ahmad Dahlan, Dept Informat Syst, Yogyakarta, Indonesia
[2] Univ Malaya, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
关键词
Clustering; Categorical data; Fuzzy k-Partition; Indescernibility relation; ALGORITHM; MODEL;
D O I
10.1016/j.engappai.2016.01.026
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Categorical data clustering has been adopted by many scientific communities to classify objects from large databases. In order to classify the objects, Fuzzy k-Partition approach has been proposed for categorical data clustering. However, existing Fuzzy k-Partition approaches suffer from high computational time and low clustering accuracy. Moreover, the parameter maximize of the classification likelihood function in Fuzzy k-Partition approach will always have the same categories, hence producing the same results. To overcome these issues, we propose a modified Fuzzy k-Partition based on indiscernibility relation. The indiscernibility relation induces an approximation space which is constructed by equivalence classes of indiscernible objects, thus it can be applied to classify categorical data. The novelty of the proposed approach is that unlike previous approach that use the likelihood function of multivariate multinomial distributions, the proposed approach is based on indescernibility relation. We performed an extensive theoretical analysis of the proposed approach to show its effectiveness in achieving lower computational complexity. Further, we compared the proposed approach with Fuzzy Centroid and Fuzzy k-Partition approaches in terms of response time and clustering accuracy on several UCI benchmark and real world datasets. The results show that the proposed approach achieves lower response time and higher clustering accuracy as compared to other Fuzzy k-based approaches. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:41 / 52
页数:12
相关论文
共 50 条
  • [1] Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes
    Mehta, Darshan
    Tripathy, B. K.
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 254 - 263
  • [2] A Framework of Fuzzy Partition Based on Artificial Bee Colony for Categorical Data Clustering
    Yanto, Iwan Tri Riyadi
    Saadi, Younes
    Hartama, Dedy
    Ismi, Dewi Pramudi
    Pranolo, Andri
    PROCEEDINGS OF 2016 2ND INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH) - INFORMATION SCIENCE FOR GREEN SOCIETY AND ENVIRONMENT, 2016, : 260 - 263
  • [3] Partition-and-merge based fuzzy genetic clustering algorithm for categorical data
    Thi Phuong Quyen Nguyen
    Kuo, R. J.
    APPLIED SOFT COMPUTING, 2019, 75 : 254 - 264
  • [4] A fuzzy k-modes algorithm for clustering categorical data
    Huang, ZX
    Ng, MK
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (04) : 446 - 452
  • [5] Fuzzy rough clustering for categorical data
    Xu, Shuliang
    Liu, Shenglan
    Zhou, Jian
    Feng, Lin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3213 - 3223
  • [6] A modified K-means algorithm for categorical data clustering
    Sun, Y
    Zhu, QM
    Chen, ZX
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 31 - 37
  • [7] Ensemble based rough fuzzy clustering for categorical data
    Saha, Indrajit
    Sarkar, Jnanendra Prasad
    Maulik, Ujjwal
    KNOWLEDGE-BASED SYSTEMS, 2015, 77 : 114 - 127
  • [8] Kernel-Based k-Representatives Algorithm for Fuzzy Clustering of Categorical Data
    Mau, Toan Nguyen
    Huynh, Van-Nam
    IEEE CIS INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS 2021 (FUZZ-IEEE), 2021,
  • [9] A genetic fuzzy k-Modes algorithm for clustering categorical data
    Gan, G.
    Wu, J.
    Yang, Z.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1615 - 1620
  • [10] Formulations of fuzzy clustering for categorical data
    Umayahara, Kazutaka
    Miyamoto, Sadaaki
    Nakamori, Yoshiteru
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2005, 1 (01): : 83 - 94