Sparse kernel spectral clustering models for large-scale data analysis

被引:22
作者
Alzate, Carlos [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD SISTA, B-3001 Louvain, Belgium
关键词
Kernel spectral clustering; Sparseness; Large-scale data; Kernel methods; FORMULATION;
D O I
10.1016/j.neucom.2011.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kernel spectral clustering has been formulated within a primal-dual optimization setting allowing natural extensions to out-of-sample data together with model selection in a learning framework. This becomes important for predictive purposes and for good generalization capabilities. The clustering model is formulated in the primal in terms of mappings to high-dimensional feature spaces typical of support vector machines and kernel-based methodologies. The dual problem corresponds to an eigenvalue decomposition of a centered Laplacian matrix derived from pairwise similarities within the data. The out-of-sample extension can also be used to introduce sparsity and to reduce the computational complexity of the resulting eigenvalue problem. In this paper, we propose several methods to obtain sparse and highly sparse kernel spectral clustering models. The proposed approaches are based on structural properties of the solutions when the clusters are well formed. Experimental results with difficult toy examples and images show the applicability of the proposed sparse models with predictive capabilities. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1382 / 1390
页数:9
相关论文
共 34 条
[1]  
Alzate C., 2010, P 18 EUR S ART NEUR, P235
[2]   A regularized kernel CCA contrast function for ICA [J].
Alzate, Carlos ;
Suykens, Johan A. K. .
NEURAL NETWORKS, 2008, 21 (2-3) :170-181
[3]   Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA [J].
Alzate, Carlos ;
Suykens, Johan A. K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) :335-347
[4]  
Alzate C, 2006, IEEE IJCNN, P138
[5]   Sparse Kernel Models for Spectral Clustering Using the Incomplete Cholesky Decomposition [J].
Alzate, Carlos ;
Suykens, Johan A. K. .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :3556-3563
[6]  
[Anonymous], 2001, ADV NEURAL INFORM PR
[7]  
[Anonymous], 2010, CVX: Matlab software for disciplined convex programming (web page and software)
[8]  
[Anonymous], 2002, Journal of machine learning research
[9]  
Bach FR, 2006, J MACH LEARN RES, V7, P1963
[10]  
Boyd S., 2004, CONVEX OPTIMIZATION, VFirst, DOI DOI 10.1017/CBO9780511804441