Sparse kernel spectral clustering models for large-scale data analysis

被引：22

作者：

Alzate, Carlos ^{[1
]}

Suykens, Johan A. K. ^{[1
]}

机构：

[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD SISTA, B-3001 Louvain, Belgium

来源：

NEUROCOMPUTING | 2011年 / 74卷 / 09期

关键词：

Kernel spectral clustering; Sparseness; Large-scale data; Kernel methods; FORMULATION;

D O I：

10.1016/j.neucom.2011.01.001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Kernel spectral clustering has been formulated within a primal-dual optimization setting allowing natural extensions to out-of-sample data together with model selection in a learning framework. This becomes important for predictive purposes and for good generalization capabilities. The clustering model is formulated in the primal in terms of mappings to high-dimensional feature spaces typical of support vector machines and kernel-based methodologies. The dual problem corresponds to an eigenvalue decomposition of a centered Laplacian matrix derived from pairwise similarities within the data. The out-of-sample extension can also be used to introduce sparsity and to reduce the computational complexity of the resulting eigenvalue problem. In this paper, we propose several methods to obtain sparse and highly sparse kernel spectral clustering models. The proposed approaches are based on structural properties of the solutions when the clusters are well formed. Experimental results with difficult toy examples and images show the applicability of the proposed sparse models with predictive capabilities. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：1382 / 1390

页数：9

共 34 条

[11] A COMPUTATIONAL APPROACH TO EDGE-DETECTION [J].

CANNY, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) :679-698

[12] SPECTRAL K-WAY RATIO-CUT PARTITIONING AND CLUSTERING [J].

CHAN, PK ;

SCHLAG, MDF ;

ZIEN, JY .

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1994, 13 (09) :1088-1096

[13]

Chung F., 1992, Spectral Graph Theory

[14]

Ding C., 2004, P 21 INT C MACH LEAR, P29, DOI [10.1145/1015330.1015408, DOI 10.1145/1015330.1015408]

[15]

Ding C., 2004, P 21 INT C MACH LEAR, P30, DOI [DOI 10.1145/1015330.1015407, 10.1145/1015330.1015407]

[16] ON A THEOREM OF WEYL CONCERNING EIGENVALUES OF LINEAR TRANSFORMATIONS .1. [J].

FAN, K .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1949, 35 (11) :652-655

[17] Efficient SVM training using low-rank kernel representations [J].

Fine, S ;

Scheinberg, K .

JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :243-264

[18] Spectral grouping using the Nystrom method [J].

Fowlkes, C ;

Belongie, S ;

Chung, F ;

Malik, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (02) :214-225

[19] Orthogonal series density estimation and the kernel eigenvalue problem [J].

Girolami, M .

NEURAL COMPUTATION, 2002, 14 (03) :669-688

[20]

Golub G. H., 1996, MATRIX COMPUTATIONS

← 1 2 3 4 →