Research Progress on Semi-Supervised Clustering

被引:62
作者
Qin, Yue [1 ,2 ]
Ding, Shifei [1 ,2 ]
Wang, Lijuan [1 ,2 ,3 ]
Wang, Yanru [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minist Educ Peoples Republ China, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xu Zhou Coll Ind Technol, Sch Informat & Elect Engn, Xuzhou 221400, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; Clustering; Semi-supervised clustering; Pairwise constraints; Labeled; CLASSIFICATION; SAMPLES;
D O I
10.1007/s12559-019-09664-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised clustering is a new learning method which combines semi-supervised learning (SSL) and cluster analysis. It is widely valued and applied to machine learning. Traditional unsupervised clustering algorithm based on data partition does not need any property; however, there are a small amount of independent class labels or pair constraint information data samples in practice; in order to obtain better clustering results, scholars have proposed a semi-supervised clustering. Compared with traditional clustering methods, it can effectively improve clustering performance through a small number of supervised information, and it has been used widely in machine learning. Firstly, this paper introduces the research status and classification of semi-supervised learning and compares the four classification methods as follows: decentralized model, support vector machine, graph, and collaborative training. Secondly, the semi-supervised clustering is described in detail, the current status of semi-supervised clustering is analyzed, and the Cop-kmeans algorithm, Lcop-kmeans algorithm, Seeded-kmeans algorithm, SC-kmeans algorithm, and other algorithms are introduced. The introduction of several semi-supervised clustering methods in this paper can show the advantages of semi-supervised clustering over traditional clustering, and the related literature in recent years is summarized. This paper summarized the latest development of semi-supervised learning and semi-supervised clustering and discussed the application of semi-supervised clustering and the future research direction.
引用
收藏
页码:599 / 612
页数:14
相关论文
共 99 条
[41]   Local homogeneous consistent safe semi-supervised clustering [J].
Gan, Haitao ;
Fan, Yingle ;
Luo, Zhizeng ;
Zhang, Qizhong .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 97 :384-393
[42]  
Goldberg AB, 2008, LECT NOTES ARTIF INT, V5211, P393, DOI 10.1007/978-3-540-87479-9_44
[43]   Multi-step dimensionality reduction and semi-supervised graph-based tumor classification using gene expression data [J].
Gui, Jie ;
Wang, Shu-Lin ;
Lei, Ying-Ke .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2010, 50 (03) :181-191
[44]   A Survey of of the BCube Data Center Network Topology [J].
Han Zhijie ;
Yu Lin .
2018 IEEE 4TH INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), 4THIEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2018, :229-231
[45]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[46]  
He Ping, 2014, Journal of Software, V25, P997
[47]   Multimodal Deep Autoencoder for Human Pose Recovery [J].
Hong, Chaoqun ;
Yu, Jun ;
Wan, Jian ;
Tao, Dacheng ;
Wang, Meng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5659-5670
[48]   Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval [J].
Hong, Chaoqun ;
Yu, Jun ;
Tao, Dacheng ;
Wang, Meng .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (06) :3742-3751
[49]  
Huang RZ, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P113
[50]  
Kingma D. P., 2014, Advances in Neural Information Processing Systems, V27, P3581