Research Progress on Semi-Supervised Clustering

被引:62
作者
Qin, Yue [1 ,2 ]
Ding, Shifei [1 ,2 ]
Wang, Lijuan [1 ,2 ,3 ]
Wang, Yanru [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minist Educ Peoples Republ China, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xu Zhou Coll Ind Technol, Sch Informat & Elect Engn, Xuzhou 221400, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; Clustering; Semi-supervised clustering; Pairwise constraints; Labeled; CLASSIFICATION; SAMPLES;
D O I
10.1007/s12559-019-09664-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised clustering is a new learning method which combines semi-supervised learning (SSL) and cluster analysis. It is widely valued and applied to machine learning. Traditional unsupervised clustering algorithm based on data partition does not need any property; however, there are a small amount of independent class labels or pair constraint information data samples in practice; in order to obtain better clustering results, scholars have proposed a semi-supervised clustering. Compared with traditional clustering methods, it can effectively improve clustering performance through a small number of supervised information, and it has been used widely in machine learning. Firstly, this paper introduces the research status and classification of semi-supervised learning and compares the four classification methods as follows: decentralized model, support vector machine, graph, and collaborative training. Secondly, the semi-supervised clustering is described in detail, the current status of semi-supervised clustering is analyzed, and the Cop-kmeans algorithm, Lcop-kmeans algorithm, Seeded-kmeans algorithm, SC-kmeans algorithm, and other algorithms are introduced. The introduction of several semi-supervised clustering methods in this paper can show the advantages of semi-supervised clustering over traditional clustering, and the related literature in recent years is summarized. This paper summarized the latest development of semi-supervised learning and semi-supervised clustering and discussed the application of semi-supervised clustering and the future research direction.
引用
收藏
页码:599 / 612
页数:14
相关论文
共 99 条
[1]   LEARNING WITH A PROBABILISTIC TEACHER [J].
AGRAWALA, AK .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1970, 16 (04) :373-+
[2]  
[Anonymous], P 2 DIG FOR RES WORK
[3]  
[Anonymous], EXPONENTIAL VALUE LA
[4]  
[Anonymous], SDM
[5]  
[Anonymous], INT C MACH LEARN
[6]  
[Anonymous], SEMISUPERVISED CLUST
[7]  
[Anonymous], PERSP RETHINK REFORM
[8]  
[Anonymous], LEARNING LABELED UNL
[9]  
[Anonymous], J SOFTW
[10]  
[Anonymous], 3 INT SEM ED INN EC