Fast Monte-Carlo algorithms for finding low-rank approximations

被引:313
作者
Frieze, A [1 ]
Kannan, R
Vempala, S
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Yale Univ, New Haven, CT USA
[3] MIT, Cambridge, MA 02139 USA
关键词
algorithms; theory; matrix algorithms; sampling; low-rank approximation;
D O I
10.1145/1039488.1039494
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of approximating a given m x n matrix A by another matrix of specified rank k, which is smaller than in and n. The Singular Value Decomposition (SVD) can be used to find the "best" such approximation. However, it takes time polynomial in m, n which is prohibitive for some modern applications. In this article, we develop an algorithm that is qualitatively faster, provided we may sample the entries of the matrix in accordance with a natural probability distribution. In many applications, such sampling can be done efficiently. Our main result is a randomized algorithm to find the description of a matrix D* of rank at most k so that [GRAPHICS] holds with probability at least 1 - delta (where parallel to.parallel to(F) is the Frobenius norm). The algorithm takes time polynomial in k, 1/epsilon, log(1/delta) only and is independent of m and n. In particular, this implies that in constant time, it can be determined if a given matrix of arbitrary size has a good low-rank approximation.
引用
收藏
页码:1025 / 1041
页数:17
相关论文
共 21 条
[1]  
ACHILIOPTAS D, 2001, P 33 ANN S THEOR COM, P611
[2]   THE ALGORITHMIC ASPECTS OF THE REGULARITY LEMMA [J].
ALON, N ;
DUKE, RA ;
LEFMANN, H ;
RODL, V ;
YUSTER, R .
JOURNAL OF ALGORITHMS, 1994, 16 (01) :80-109
[3]  
Bar-Yossef Z., 2003, Proceedings of the 35th Annual ACM Symposium on the Theory of Computing, P335, DOI DOI 10.1145/780542.780593
[4]   Using linear algebra for intelligent information retrieval [J].
Berry, MW ;
Dumais, ST ;
OBrien, GW .
SIAM REVIEW, 1995, 37 (04) :573-595
[5]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[6]  
2-9
[7]   Clustering large graphs via the Singular Value Decomposition [J].
Drineas, P ;
Frieze, A ;
Kannan, R ;
Vempala, S ;
Vinay, V .
MACHINE LEARNING, 2004, 56 (1-3) :9-33
[8]  
Drineas P, 2003, SIAM PROC S, P223
[9]  
DRINEAS P, 2004, YALEUDCSTR1270
[10]   Fast Monte-Carlo algorithms for approximate matrix multiplication [J].
Drineas, PE ;
Kannan, R .
42ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2001, :452-459