Data clustering with partial supervision

被引:60
作者
Bouchachia, A
Pedrycz, W
机构
[1] Univ Klagenfurt, Dept Informat, A-9020 Klagenfurt, Austria
[2] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2V4, Canada
关键词
clustering; partial supervision; classification; class discrimination; linear regression;
D O I
10.1007/s10618-005-0019-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering with partial supervision finds its application in situations where data is neither entirely nor accurately labeled. This paper discusses a semi-supervised clustering algorithm based on a modified version of the fuzzy C-Means (FCM) algorithm. The objective function of the proposed algorithm consists of two components. The first concerns traditional unsupervised clustering while the second tracks the relationship between classes (available labels) and the clusters generated by the first component. The balance between the two components is tuned by a scaling factor. Comprehensive experimental studies are presented. First, the discrimination of the proposed algorithm is discussed before its reformulation as a classifier is addressed. The induced classifier is evaluated on completely labeled data and validated by comparison against some fully supervised classifiers, namely support vector machines and neural networks. This classifier is then evaluated and compared against three semi-supervised algorithms in the context of learning from partly labeled data. In addition, the behavior of the algorithm is discussed and the relation between classes and clusters is investigated using a linear regression model. Finally, the complexity of the algorithm is briefly discussed.
引用
收藏
页码:47 / 78
页数:32
相关论文
共 22 条
[1]  
Amini M.-R., 2003, IJCAI 03 P 18 INT JO, P555
[2]  
[Anonymous], 2005, Advances in Neural Information Processing Systems
[3]  
[Anonymous], 1983, Statistical methods
[4]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[5]  
Basu S., 2002, P 19 INT C MACHINE L, P19, DOI [10.5555/645531.656012, DOI 10.5555/645531.656012]
[6]  
Bennett KP, 1999, ADV NEUR IN, V11, P368
[7]  
Bishop C. M., 1996, Neural networks for pattern recognition
[8]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[9]  
Blum A., 2004, P 21 INT C MACH LEAR, P92
[10]  
Bouchachia A., 2005, P WORKSH LEARN PART, P10