A majorization-minimization approach to the sparse generalized eigenvalue problem

被引：98

作者：

Sriperumbudur, Bharath K. ^{[1
]}

Torres, David A. ^{[2
]}

Lanckriet, Gert R. G. ^{[1
,2
]}

机构：

[1] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA 92093 USA

[2] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92093 USA

来源：

MACHINE LEARNING | 2011年 / 85卷 / 1-2期

基金：

美国国家科学基金会;

关键词：

Generalized eigenvalue problem; Principal component analysis; Canonical correlation analysis; Fisher discriminant analysis; Sparsity; D.c; program; Majorization-minimization; Zangwill's theory of global convergence; Music annotation; Cross-language document retrieval; VARIABLE SELECTION; PRINCIPAL; ALGORITHMS; CANCER; TUMOR;

D O I：

10.1007/s10994-010-5226-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generalized eigenvalue (GEV) problems have applications in many areas of science and engineering. For example, principal component analysis (PCA), canonical correlation analysis (CCA) and Fisher discriminant analysis (FDA) are specific instances of GEV problems, that are widely used in statistical data analysis. The main contribution of this work is to formulate a general, efficient algorithm to obtain sparse solutions to a GEV problem. Specific instances of sparse GEV problems can then be solved by specific instances of this algorithm. We achieve this by solving the GEV problem while constraining the cardinality of the solution. Instead of relaxing the cardinality constraint using a a"" (1)-norm approximation, we consider a tighter approximation that is related to the negative log-likelihood of a Student's t-distribution. The problem is then framed as a d.c. (difference of convex functions) program and is solved as a sequence of convex programs by invoking the majorization-minimization method. The resulting algorithm is proved to exhibit global convergence behavior, i.e., for any random initialization, the sequence (subsequence) of iterates generated by the algorithm converges to a stationary point of the d.c. program. Finally, we illustrate the merits of this general sparse GEV algorithm with three specific examples of sparse GEV problems: sparse PCA, sparse CCA and sparse FDA. Empirical evidence for these examples suggests that the proposed sparse GEV algorithm, which offers a general framework to solve any sparse GEV problem, will give rise to competitive algorithms for a variety of applications where specific instances of GEV problems arise.

引用

页码：3 / 39

页数：37

共 63 条

[1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].

Alon, U ;

Barkai, N ;

Notterman, DA ;

Gish, K ;

Ybarra, S ;

Mack, D ;

Levine, AJ .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750

[2]

[Anonymous], 1969, Nonlinear programming: a unified approach

[3]

[Anonymous], 2005, Adv. Neural Inform. Processing Systems

[4]

[Anonymous], 2008, NIPS

[5]

BIOUCASDIAS J, 2006, P IEEE INT C AC SPEE

[6] MONOTONICITY OF QUADRATIC-APPROXIMATION ALGORITHMS [J].

BOHNING, D ;

LINDSAY, BG .

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1988, 40 (04) :641-663

[7]

Bonnans JF., 2006, Numerical optimization: Theoretical and practical aspects, V2

[8]

Boyd S.P, 2004, Convex optimization, DOI [DOI 10.1017/CBO9780511804441, 10.1017/CBO9780511804441]

[9]

BRADLEY PS, 1998, P 15 INT C MACH LEAR, P82

[10] LOADINGS AND CORRELATIONS IN THE INTERPRETATION OF PRINCIPAL COMPONENTS [J].

CADIMA, J ;

JOLLIFFE, IT .

JOURNAL OF APPLIED STATISTICS, 1995, 22 (02) :203-214

← 1 2 3 4 5 6 7 →