Sparse kernel spectral clustering models for large-scale data analysis

被引:22
作者
Alzate, Carlos [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD SISTA, B-3001 Louvain, Belgium
关键词
Kernel spectral clustering; Sparseness; Large-scale data; Kernel methods; FORMULATION;
D O I
10.1016/j.neucom.2011.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kernel spectral clustering has been formulated within a primal-dual optimization setting allowing natural extensions to out-of-sample data together with model selection in a learning framework. This becomes important for predictive purposes and for good generalization capabilities. The clustering model is formulated in the primal in terms of mappings to high-dimensional feature spaces typical of support vector machines and kernel-based methodologies. The dual problem corresponds to an eigenvalue decomposition of a centered Laplacian matrix derived from pairwise similarities within the data. The out-of-sample extension can also be used to introduce sparsity and to reduce the computational complexity of the resulting eigenvalue problem. In this paper, we propose several methods to obtain sparse and highly sparse kernel spectral clustering models. The proposed approaches are based on structural properties of the solutions when the clusters are well formed. Experimental results with difficult toy examples and images show the applicability of the proposed sparse models with predictive capabilities. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1382 / 1390
页数:9
相关论文
共 34 条
[12]   SPECTRAL K-WAY RATIO-CUT PARTITIONING AND CLUSTERING [J].
CHAN, PK ;
SCHLAG, MDF ;
ZIEN, JY .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1994, 13 (09) :1088-1096
[13]  
Chung F., 1992, Spectral Graph Theory
[14]  
Ding C., 2004, P 21 INT C MACH LEAR, P29, DOI [10.1145/1015330.1015408, DOI 10.1145/1015330.1015408]
[15]  
Ding C., 2004, P 21 INT C MACH LEAR, P30, DOI [DOI 10.1145/1015330.1015407, 10.1145/1015330.1015407]
[17]   Efficient SVM training using low-rank kernel representations [J].
Fine, S ;
Scheinberg, K .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :243-264
[18]   Spectral grouping using the Nystrom method [J].
Fowlkes, C ;
Belongie, S ;
Chung, F ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (02) :214-225
[19]   Orthogonal series density estimation and the kernel eigenvalue problem [J].
Girolami, M .
NEURAL COMPUTATION, 2002, 14 (03) :669-688
[20]  
Golub G. H., 1996, MATRIX COMPUTATIONS