A fuzzy SV-k-modes algorithm for clustering categorical data with set-valued attributes

被引:10
作者
Cao, Fuyuan [1 ]
Huang, Joshua Zhexue [2 ]
Liang, Jiye [1 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan 030006, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Categorical data; Set-valued attribute; Set-valued modes; Fuzzy k-modes; Fuzzy SV-k-modes;
D O I
10.1016/j.amc.2016.09.023
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a fuzzy SV-k-modes algorithm that uses the fuzzy k-modes clustering process to cluster categorical data with set-valued attributes. In the proposed algorithm, we use Jaccard coefficient to measure the dissimilarity between two objects and represent the center of a cluster with set-valued modes. A heuristic update way of cluster prototype is developed for the fuzzy partition matrix. These extensions make the fuzzy SV-k-modes algorithm can cluster categorical data with single-valued and set-valued attributes together and the fuzzy k-modes algorithm is its special case. Experimental results on the synthetic data sets and the three real data sets from different applications have shown the efficiency and effectiveness of the fuzzy SV-k-modes algorithm. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 39 条
[31]   Partition-and-merge based fuzzy genetic clustering algorithm for categorical data [J].
Thi Phuong Quyen Nguyen ;
Kuo, R. J. .
APPLIED SOFT COMPUTING, 2019, 75 :254-264
[32]   k-ANMI:: A mutual information based clustering algorithm for categorical data [J].
He, Zengyou ;
Xu, Xiaofei ;
Deng, Shengchun .
INFORMATION FUSION, 2008, 9 (02) :223-233
[33]   A modified Fuzzy k-Partition based on indiscernibility relation for categorical data clustering [J].
Yanto, Iwan Tri Riyadi ;
Ismail, Maizatul Akmar ;
Herawan, Tutut .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 53 :41-52
[34]   The k-modes type clustering plus between-cluster information for categorical data [J].
Bai, Liang ;
Liang, Jiye .
NEUROCOMPUTING, 2014, 133 :111-121
[35]   Extensions to the k-means algorithm for clustering large data sets with categorical values [J].
Huang, ZX .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (03) :283-304
[36]   Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values [J].
Zhexue Huang .
Data Mining and Knowledge Discovery, 1998, 2 :283-304
[37]   FKMAWCW: Categorical fuzzy k-modes clustering with automated attribute-weight and cluster-weight learning [J].
Oskouei, Amin Golzari ;
Balafar, Mohammad Ali ;
Motamed, Cina .
CHAOS SOLITONS & FRACTALS, 2021, 153
[38]   Automatic Fuzzy Clustering Using Non-Dominated Sorting Particle Swarm Optimization Algorithm for Categorical Data [J].
Thi Phuong Quyen Nguyen ;
Kuo, R. J. .
IEEE ACCESS, 2019, 7 :99721-99734
[39]   A kernel-based intuitionistic weight fuzzy k-modes algorithm using coupled chained P system combines DNA genetic rules for categorical data [J].
Jiang, Zhenni ;
Liu, Xiyu ;
Zang, Wenke .
NEUROCOMPUTING, 2023, 528 :84-96