Semisupervised Classification With Cluster Regularization

被引:47
作者
Soares, Rodrigo G. F. [1 ]
Chen, Huanhuan [1 ]
Yao, Xin [1 ]
机构
[1] Univ Birmingham, Ctr Excellence Res Computat Intelligence & Applic, Birmingham B15 2TT, W Midlands, England
关键词
Clustering; machine learning; regularization; semisupervised learning;
D O I
10.1109/TNNLS.2012.2214488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semisupervised classification (SSC) learns, from cheap unlabeled data and labeled data, to predict the labels of test instances. In order to make use of the information from unlabeled data, there should be an assumed relationship between the true class structure and the data distribution. One assumption is that data points clustered together are likely to have the same class label. In this paper, we propose a new algorithm, namely, cluster-based regularization (ClusterReg) for SSC, that takes the partition given by a clustering algorithm as a regularization term in the loss function of an SSC classifier. ClusterReg makes predictions according to the cluster structure together with limited labeled data. The experiments confirmed that ClusterReg has a good generalization ability for real-world problems. Its performance is excellent when data follows this cluster assumption. Even when these clusters have misleading overlaps, it still outperforms other state-of-the-art algorithms.
引用
收藏
页码:1779 / 1792
页数:14
相关论文
共 26 条
[1]  
[Anonymous], 2006, Pattern recognition and machine learning
[2]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[3]   Bayesian semi-supervised learning with support vector machine [J].
Chakraborty, Sounak .
STATISTICAL METHODOLOGY, 2011, 8 (01) :68-82
[4]  
Chapelle Olivier, 2006, IEEE Transactions on Neural Networks, DOI DOI 10.1109/TNN.2009.2015974
[5]  
Chen HH, 2006, ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, P878
[6]   Probabilistic Classification Vector Machines [J].
Chen, Huanhuan ;
Tino, Peter ;
Yao, Xin .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (06) :901-914
[7]   Predictive Ensemble Pruning by Expectation Propagation [J].
Chen, Huanhuan ;
Tino, Peter ;
Yao, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (07) :999-1013
[8]   Semi-Supervised Learning via Regularized Boosting Working on Multiple Semi-Supervised Assumptions [J].
Chen, Ke ;
Wang, Shihai .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (01) :129-143
[9]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[10]  
Dunne R. A., 1997, Proceedings of the Eighth Australian Conference on Neural Networks (ACNN'97), P181