A second order cone programming approach for semi-supervised learning

被引:6
作者
Huang, Gao [1 ]
Song, Shiji [1 ]
Gupta, Jatinder N. D. [2 ]
Wu, Cheng [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Univ Alabama, Coll Business Adm, Huntsville, AL 35899 USA
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; K-nearest-neighbor; Support vector machine; Second order cone programming; CLASSIFICATION; OPTIMIZATION;
D O I
10.1016/j.patcog.2013.06.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) involves the training of a decision rule from both labeled and unlabeled data. In this paper, we propose a novel SSL algorithm based on the multiple clusters per class assumption. The proposed algorithm consists of two stages. In the first stage, we aim to capture the local cluster structure of the training data by using the k-nearest-neighbor (kNN) algorithm to split the data into a number of disjoint subsets. In the second stage, a maximal margin classifier based on the second order cone programming (SOCP) is introduced to learn an inductive decision function from the obtained subsets globally. For linear classification problems, once the kNN algorithm has been performed, the proposed algorithm trains a classifier using only the first and second order moments of the subsets without considering individual data points. Since the number of subsets is usually much smaller than the number of training points, the proposed algorithm is efficient for handling big data sets with a large amount of unlabeled data. Despite its simplicity, the classification performance of the proposed algorithm is guaranteed by the maximal margin classifier. We demonstrate the efficiency and effectiveness of the proposed algorithm on both synthetic and real-world data sets. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3548 / 3558
页数:11
相关论文
共 39 条
[1]  
Abou-Moustafa K., 2011, P 33 INT C PATT REC, P335
[2]  
Abou-Moustafa KarimT., 2012, Asian_Conference_on_Machine_Learning,_JMLR: Workshop_and_Conference_Proceedings, P1
[3]   Help-Training for semi-supervised support vector machines [J].
Adankon, Mathias M. ;
Cheriet, Mohamed .
PATTERN RECOGNITION, 2011, 44 (09) :2220-2230
[4]  
[Anonymous], 2006, P 23 INT C MACH LEAR
[5]  
[Anonymous], 2002, School Comput. Sci., Tech. Rep. CMU-CALD02-107
[6]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[7]  
Blum A, 2001, P 18 INT C MACH LEAR, P19, DOI DOI 10.1184/R1/6606860.V1
[8]  
Chapelle O., 2005, P 10 INT WORKSH ART, P57
[9]  
Chapelle O, 2006, SEMISUPERVISED LEARN, V2
[10]  
Chapelle O, 2008, J MACH LEARN RES, V9, P203