A fuzzy SV-k-modes algorithm for clustering categorical data with set-valued attributes

被引:10
作者
Cao, Fuyuan [1 ]
Huang, Joshua Zhexue [2 ]
Liang, Jiye [1 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan 030006, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Categorical data; Set-valued attribute; Set-valued modes; Fuzzy k-modes; Fuzzy SV-k-modes;
D O I
10.1016/j.amc.2016.09.023
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a fuzzy SV-k-modes algorithm that uses the fuzzy k-modes clustering process to cluster categorical data with set-valued attributes. In the proposed algorithm, we use Jaccard coefficient to measure the dissimilarity between two objects and represent the center of a cluster with set-valued modes. A heuristic update way of cluster prototype is developed for the fuzzy partition matrix. These extensions make the fuzzy SV-k-modes algorithm can cluster categorical data with single-valued and set-valued attributes together and the fuzzy k-modes algorithm is its special case. Experimental results on the synthetic data sets and the three real data sets from different applications have shown the efficiency and effectiveness of the fuzzy SV-k-modes algorithm. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 39 条
  • [21] Block Fuzzy K-modes Clustering Algorithm
    Yang, Miin-Shen
    Lin, Chih-Ying
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 384 - 389
  • [22] Categorical data clustering: 25 years beyond K-modes
    Dinh, Tai
    Wong, Hauchi
    Fournier-Viger, Philippe
    Lisik, Daniil
    Ha, Minh-Quyet
    Dam, Hieu-Chi
    Huynh, Van-Nam
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [23] A modified K-means algorithm for categorical data clustering
    Sun, Y
    Zhu, QM
    Chen, ZX
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 31 - 37
  • [24] MMR: An algorithm for clustering categorical data using Rough Set Theory
    Parmar, Darshit
    Wu, Teresa
    Blackhurst, Jennifer
    DATA & KNOWLEDGE ENGINEERING, 2007, 63 (03) : 879 - 893
  • [25] Many-objective fuzzy centroids clustering algorithm for categorical data
    Zhu, Shuwei
    Xu, Lihong
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 230 - 248
  • [26] Application of metaheuristic based fuzzy K-modes algorithm to supplier clustering
    Kuo, R. J.
    Potti, Yuliana
    Zulvia, Ferani E.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 120 : 298 - 307
  • [27] High Dimensional Data Clustering Algorithm Based on Sparse Feature Vector for Categorical Attributes
    Wu, Sen
    Wei, Guiying
    PROCEEDINGS OF 2010 INTERNATIONAL CONFERENCE ON LOGISTICS SYSTEMS AND INTELLIGENT MANAGEMENT, VOLS 1-3, 2010, : 973 - 976
  • [28] Hierarchical clustering algorithm for categorical data using a probabilistic rough set model
    Li, Min
    Deng, Shaobo
    Wang, Lei
    Feng, Shengzhong
    Fan, Jianping
    KNOWLEDGE-BASED SYSTEMS, 2014, 65 : 60 - 71
  • [29] A fuzzy c-means-type algorithm for clustering of data with mixed numeric and categorical attributes employing a probabilistic dissimilarity functional
    Chatzis, Sotirios P.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8684 - 8689
  • [30] Partition-and-merge based fuzzy genetic clustering algorithm for categorical data
    Thi Phuong Quyen Nguyen
    Kuo, R. J.
    APPLIED SOFT COMPUTING, 2019, 75 : 254 - 264