SEMIDEFINITE PROGRAMMING BASED PRECONDITIONING FOR MORE ROBUST NEAR-SEPARABLE NONNEGATIVE MATRIX FACTORIZATION

被引:29
作者
Gillis, Nicolas [1 ]
Vavasis, Stephen A. [2 ]
机构
[1] Univ Mons, Dept Math & Operat Res, Fac Polytech, B-7000 Mons, Belgium
[2] Univ Waterloo, Dept Combinator & Optimizat, Waterloo, ON N2L 3G1, Canada
关键词
nonnegative matrix factorization; semidefinite programming; preconditioning; separability; robustness to noise; minimum volume ellipsoid; ALGORITHM; COMPUTATION; MODEL;
D O I
10.1137/130940670
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Nonnegative matrix factorization (NMF) under the separability assumption can provably be solved efficiently, even in the presence of noise, and has been shown to be a powerful technique in document classification and hyperspectral unmixing. This problem is referred to as near-separable NMF and requires that there exists a cone spanned by a small subset of the columns of the input nonnegative matrix approximately containing all columns. In this paper, we propose a preconditioning based on the minimum volume ellipsoid and semidefinite programming making the input matrix well-conditioned. This in turn can improve significantly the performance of near-separable NMF algorithms which is illustrated on the popular successive projection algorithm (SPA). The new preconditioned SPA is provably more robust to noise, and outperforms SPA on several synthetic data sets. We also show how an active-set method allows us to apply the preconditioning on large-scale real-world hyperspectral images.
引用
收藏
页码:677 / 698
页数:22
相关论文
共 35 条
[1]   Linear convergence of a modified Frank-Wolfe algorithm for computing minimum-volume enclosing ellipsoids [J].
Ahipasaoglu, S. Damla ;
Sun, Peng ;
Todd, Michael J. .
OPTIMIZATION METHODS & SOFTWARE, 2008, 23 (01) :5-19
[2]   Hyperspectral Data Geometry-Based Estimation of Number of Endmembers Using p-Norm-Based Pure Pixel Identification Algorithm [J].
Ambikapathi, ArulMurugan ;
Chan, Tsung-Han ;
Chi, Chong-Yung ;
Keizer, Kannan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (05) :2753-2769
[3]  
[Anonymous], 2011, CVX MATLAB SOFTWARE
[4]  
[Anonymous], 2012, P ADV NEURAL INFORM
[5]   The successive projections algorithm for variable selection in spectroscopic multicomponent analysis [J].
Araújo, MCU ;
Saldanha, TCB ;
Galvao, RKH ;
Yoneyama, T ;
Chame, HC ;
Visani, V .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 57 (02) :65-73
[6]  
Arora S., 2013, International Conference on Machine Learning, P280
[7]  
Arora S, 2012, STOC'12: PROCEEDINGS OF THE 2012 ACM SYMPOSIUM ON THEORY OF COMPUTING, P145
[8]   Learning Topic Models - Going beyond SVD [J].
Arora, Sanjeev ;
Ge, Rong ;
Moitra, Ankur .
2012 IEEE 53RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2012, :1-10
[9]  
AVRON H., 2011, PREPRINT
[10]  
Boyd S., 2004, CONVEX OPTIMIZATION, VFirst, DOI DOI 10.1017/CBO9780511804441