Distributed Matrix Completion

被引:50
作者
Teflioudi, Christina [1 ]
Makari, Faraz [1 ]
Gemulla, Rainer [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012) | 2012年
关键词
parallel and distributed matrix factorization; stochastic gradient descent; ALS; recommender systems;
D O I
10.1109/ICDM.2012.120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss parallel and distributed algorithms for large-scale matrix completion on problems with millions of rows, millions of columns, and billions of revealed entries. We focus on in-memory algorithms that run on a small cluster of commodity nodes; even very large problems can be handled effectively in such a setup. Our DALS, ASGD, and DSGD++ algorithms are novel variants of the popular alternating least squares and stochastic gradient descent algorithms; they exploit thread-level parallelism, in-memory processing, and asynchronous communication. We provide some guidance on the asymptotic performance of each algorithm and investigate the performance of both our algorithms and previously proposed MapReduce algorithms in large-scale experiments. We found that DSGD++ outperforms competing methods in terms of overall runtime, memory consumption, and scalability. Using DSGD++, we can factor a matrix with 10B entries on 16 compute nodes in around 40 minutes.
引用
收藏
页码:655 / 664
页数:10
相关论文
共 21 条
[1]  
[Anonymous], 2006, NIPS
[2]  
[Anonymous], 2011, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD)
[3]  
[Anonymous], 2011, NIPS
[4]  
[Anonymous], 2007, KDD CUP WORKSH
[5]  
[Anonymous], 2007, Google news personalization: scalable online collaborative filtering, DOI DOI 10.1145/1242572.1242610
[6]  
Battiti R., 1989, Complex Syst., V3, P331
[7]   A LIMITED MEMORY ALGORITHM FOR BOUND CONSTRAINED OPTIMIZATION [J].
BYRD, RH ;
LU, PH ;
NOCEDAL, J ;
ZHU, CY .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1995, 16 (05) :1190-1208
[8]   Exact Matrix Completion via Convex Optimization [J].
Candes, Emmanuel J. ;
Recht, Benjamin .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2009, 9 (06) :717-772
[9]  
Chen P.-L., 2011, KDDCUP 2011 WORKSH
[10]  
DAS S., 2010, ACM SIGMOD INT C MAN, P987, DOI DOI 10.1145/1807167.1807275