Large scale semi-supervised learning using KSC based model

被引:0
作者
Mehrkanoon, Siamak [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT STADIUS, Kasteelpk Arenberg 10, B-3001 Leuven, Belgium
来源
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2014年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Often in practice one deals with a large amount of unlabeled data, while the fraction of labeled data points will typically be small. Therefore one prefers to apply a semi-supervised algorithm, which uses both labeled and unlabeled data points in the learning process, to have a better performance. Considering the large amount of unlabeled data, making a semi-supervised algorithm scalable is an important task. In this paper we adopt a recently proposed multi-class semi-supervised KSC based algorithm (MSS-KSC) and make it scalable by means of two different approaches. The first one is based on the Nystrom approximation method which provides a finite dimensional feature map that can then be used to solve the optimization problem in the primal. The second approach is based on the reduced kernel technique that solves the problem in the dual by reducing the dimensionality of the kernel matrix to a rectangular kernel. Experimental results demonstrate the scalability and efficiency of the proposed approaches on real datasets.
引用
收藏
页码:4152 / 4159
页数:8
相关论文
共 25 条
[1]  
Alzate C., 2012, Proc. of the IEEE World Congress on Computational Intelligence, P1992
[2]   Sparse kernel spectral clustering models for large-scale data analysis [J].
Alzate, Carlos ;
Suykens, Johan A. K. .
NEUROCOMPUTING, 2011, 74 (09) :1382-1390
[3]   Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA [J].
Alzate, Carlos ;
Suykens, Johan A. K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) :335-347
[4]  
[Anonymous], 2007, Uci machine learning repository
[5]  
Baker C.T.H., 1977, The numerical treatment of integral equations, V13
[6]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[7]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]  
Chapelle O, 2006, SEMISUPERVISED LEARN, V2
[10]   Optimized fixed-size kernel models for large data sets [J].
De Brabanter, K. ;
De Brabanter, J. ;
Suykens, J. A. K. ;
De Moor, B. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (06) :1484-1504