A fuzzy SV-k-modes algorithm for clustering categorical data with set-valued attributes

被引:10
作者
Cao, Fuyuan [1 ]
Huang, Joshua Zhexue [2 ]
Liang, Jiye [1 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan 030006, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Categorical data; Set-valued attribute; Set-valued modes; Fuzzy k-modes; Fuzzy SV-k-modes;
D O I
10.1016/j.amc.2016.09.023
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a fuzzy SV-k-modes algorithm that uses the fuzzy k-modes clustering process to cluster categorical data with set-valued attributes. In the proposed algorithm, we use Jaccard coefficient to measure the dissimilarity between two objects and represent the center of a cluster with set-valued modes. A heuristic update way of cluster prototype is developed for the fuzzy partition matrix. These extensions make the fuzzy SV-k-modes algorithm can cluster categorical data with single-valued and set-valued attributes together and the fuzzy k-modes algorithm is its special case. Experimental results on the synthetic data sets and the three real data sets from different applications have shown the efficiency and effectiveness of the fuzzy SV-k-modes algorithm. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 39 条
[21]   Block Fuzzy K-modes Clustering Algorithm [J].
Yang, Miin-Shen ;
Lin, Chih-Ying .
2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, :384-389
[22]   Categorical data clustering: 25 years beyond K-modes [J].
Dinh, Tai ;
Wong, Hauchi ;
Fournier-Viger, Philippe ;
Lisik, Daniil ;
Ha, Minh-Quyet ;
Dam, Hieu-Chi ;
Huynh, Van-Nam .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
[23]   A modified K-means algorithm for categorical data clustering [J].
Sun, Y ;
Zhu, QM ;
Chen, ZX .
IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, :31-37
[24]   MMR: An algorithm for clustering categorical data using Rough Set Theory [J].
Parmar, Darshit ;
Wu, Teresa ;
Blackhurst, Jennifer .
DATA & KNOWLEDGE ENGINEERING, 2007, 63 (03) :879-893
[25]   Many-objective fuzzy centroids clustering algorithm for categorical data [J].
Zhu, Shuwei ;
Xu, Lihong .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 :230-248
[26]   Application of metaheuristic based fuzzy K-modes algorithm to supplier clustering [J].
Kuo, R. J. ;
Potti, Yuliana ;
Zulvia, Ferani E. .
COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 120 :298-307
[27]   High Dimensional Data Clustering Algorithm Based on Sparse Feature Vector for Categorical Attributes [J].
Wu, Sen ;
Wei, Guiying .
PROCEEDINGS OF 2010 INTERNATIONAL CONFERENCE ON LOGISTICS SYSTEMS AND INTELLIGENT MANAGEMENT, VOLS 1-3, 2010, :973-976
[28]   Hierarchical clustering algorithm for categorical data using a probabilistic rough set model [J].
Li, Min ;
Deng, Shaobo ;
Wang, Lei ;
Feng, Shengzhong ;
Fan, Jianping .
KNOWLEDGE-BASED SYSTEMS, 2014, 65 :60-71
[29]   A fuzzy c-means-type algorithm for clustering of data with mixed numeric and categorical attributes employing a probabilistic dissimilarity functional [J].
Chatzis, Sotirios P. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :8684-8689
[30]   A novel fuzzy clustering algorithm with between-cluster information for categorical data [J].
Bai, Liang ;
Liang, Jiye ;
Dang, Chuangyin ;
Cao, Fuyuan .
FUZZY SETS AND SYSTEMS, 2013, 215 :55-73