A majorization-minimization approach to the sparse generalized eigenvalue problem

被引:98
作者
Sriperumbudur, Bharath K. [1 ]
Torres, David A. [2 ]
Lanckriet, Gert R. G. [1 ,2 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA 92093 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92093 USA
基金
美国国家科学基金会;
关键词
Generalized eigenvalue problem; Principal component analysis; Canonical correlation analysis; Fisher discriminant analysis; Sparsity; D.c; program; Majorization-minimization; Zangwill's theory of global convergence; Music annotation; Cross-language document retrieval; VARIABLE SELECTION; PRINCIPAL; ALGORITHMS; CANCER; TUMOR;
D O I
10.1007/s10994-010-5226-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generalized eigenvalue (GEV) problems have applications in many areas of science and engineering. For example, principal component analysis (PCA), canonical correlation analysis (CCA) and Fisher discriminant analysis (FDA) are specific instances of GEV problems, that are widely used in statistical data analysis. The main contribution of this work is to formulate a general, efficient algorithm to obtain sparse solutions to a GEV problem. Specific instances of sparse GEV problems can then be solved by specific instances of this algorithm. We achieve this by solving the GEV problem while constraining the cardinality of the solution. Instead of relaxing the cardinality constraint using a a"" (1)-norm approximation, we consider a tighter approximation that is related to the negative log-likelihood of a Student's t-distribution. The problem is then framed as a d.c. (difference of convex functions) program and is solved as a sequence of convex programs by invoking the majorization-minimization method. The resulting algorithm is proved to exhibit global convergence behavior, i.e., for any random initialization, the sequence (subsequence) of iterates generated by the algorithm converges to a stationary point of the d.c. program. Finally, we illustrate the merits of this general sparse GEV algorithm with three specific examples of sparse GEV problems: sparse PCA, sparse CCA and sparse FDA. Empirical evidence for these examples suggests that the proposed sparse GEV algorithm, which offers a general framework to solve any sparse GEV problem, will give rise to competitive algorithms for a variety of applications where specific instances of GEV problems arise.
引用
收藏
页码:3 / 39
页数:37
相关论文
共 63 条
[21]   CORRESPONDENCE-ANALYSIS WITH LEAST ABSOLUTE RESIDUALS [J].
HEISER, WJ .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1987, 5 (04) :337-356
[22]   DC programming: Overview [J].
Horst, R ;
Thoai, NV .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1999, 103 (01) :1-43
[23]   Relations between two sets of variates [J].
Hotelling, H .
BIOMETRIKA, 1936, 28 :321-377
[24]   Analysis of a complex of statistical variables into principal components [J].
Hotelling, H .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1933, 24 :417-441
[25]  
Huber P. J., 1981, Robust Statistics
[26]   Variable selection using MM algorithms [J].
Hunter, DR ;
Li, RZ .
ANNALS OF STATISTICS, 2005, 33 (04) :1617-1642
[27]   A tutorial on MM algorithms [J].
Hunter, DR ;
Lange, K .
AMERICAN STATISTICIAN, 2004, 58 (01) :30-37
[28]  
Jeffers J., 1967, J R STAT SOC C-APPL, V16, P225, DOI DOI 10.2307/2985919
[29]  
Jolliffe I., 2002, PRINCIPAL COMPONENT, DOI [10.1007/978-1-4757-1904-8_7, 10.1016/0169-7439(87)80084-9]
[30]   A modified principal component technique based on the LASSO [J].
Jolliffe, IT ;
Trendafilov, NT ;
Uddin, M .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (03) :531-547