An Adaptive Three-Way Clustering Algorithm for Mixed-Type Data

被引:4
作者
Xiong, Jing [1 ]
Yu, Hong [1 ]
机构
[1] Chongqing Univ Posts & Telecommunicat, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
来源
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018) | 2018年 / 11177卷
基金
中国国家自然科学基金;
关键词
Three-way clustering; Adaptive; Mixed-type data;
D O I
10.1007/978-3-030-01851-1_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The three-way clustering is different from the traditional two-way clustering. Instead of using two regions to represent a cluster by a single set, a cluster is represented by a pair of sets, and there are three regions such as the core region, fringe region and trivial region. The three-way representation intuitively shows that which objects are fringe to the cluster and it is proposed for dealing with uncertain clustering. However, the three-way clustering algorithm usually needs an appropriate evaluation function and corresponding thresholds. It is not scientific and efficient method for setting the thresholds in advance. Meanwhile, there is a large amount of mixed-type data in real life. Therefore, this paper proposes an adaptive three-way clustering algorithm for mixed-type data, which adjusts the three-way thresholds during the clustering process based on the idea of universal gravitation by excavating more detailed ascription relation between objects and clusters. The experimental results show that the proposed algorithm has good performance in indices such as the accuracy, F-measure, RI and NMI.
引用
收藏
页码:379 / 388
页数:10
相关论文
共 12 条
[1]  
[Anonymous], 2010, UCI Machine Learning Repository
[2]   Study on density peaks clustering based on k-nearest neighbors and principal component analysis [J].
Du, Mingjing ;
Ding, Shifei ;
Jia, Hongjie .
KNOWLEDGE-BASED SYSTEMS, 2016, 99 :135-145
[3]   Interval set clustering of web users with rough K-means [J].
Lingras, P ;
West, C .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2004, 23 (01) :5-16
[4]   Rough clustering [J].
Lingras, Pawan ;
Peters, Georg .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) :64-72
[6]   Clustering by fast search and find of density peaks [J].
Rodriguez, Alex ;
Laio, Alessandro .
SCIENCE, 2014, 344 (6191) :1492-1496
[7]   CE3: A three-way clustering method based on mathematical morphology [J].
Wang, Pingxin ;
Yao, Yiyu .
KNOWLEDGE-BASED SYSTEMS, 2018, 155 :54-65
[8]   Incremental Fuzzy Clustering With Multiple Medoids for Large Data [J].
Wang, Yangtao ;
Chen, Lihui ;
Mei, Jian-Ping .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) :1557-1568
[9]   Three-way decisions with probabilistic rough sets [J].
Yao, Yiyu .
INFORMATION SCIENCES, 2010, 180 (03) :341-353
[10]   Flavor profiles of soymilk processed with four different processing technologies and 26 soybean cultivars grown in China [J].
Yu, Hansong ;
Liu, Ruixue ;
Hu, Yaohui ;
Xu, Baojun .
INTERNATIONAL JOURNAL OF FOOD PROPERTIES, 2018, 20 :S2887-S2898