Exemplar-based large-scale low-rank matrix decomposition for collaborative prediction

被引:2
作者
Lei, Hengxin [1 ]
Liu, Jinglei [2 ]
Yu, Yong [1 ]
机构
[1] Yantai Nanshan Univ, Coll Business, Longkou, Peoples R China
[2] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
关键词
CUR matrix decomposition; product recommendation; regularized singular value decomposition; interpretability; row column sampling; MONTE-CARLO ALGORITHMS; FACTORIZATION;
D O I
10.1080/00207160.2022.2141571
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Recommender system is one of the most effective tools to solve the problem of information overload. As a popular method for recommendation system, regularized singular value decomposition (RSVD) has the advantage of prediction accuracy. However, with the growing of the size of rating matrix A, RSVD suffers from both 'out of memory' and high computational cost. To alleviate these disadvantages, we utilize CUR decomposition approach to reduce the memory consumption before RSVD is applied. Additionally, the rating matrix A often is sparse, so we propose a novel column sampling algorithm and sparseness measure to solve the data sparse problem. The main computation cost in the original RSVD is to compute (MN)-N-T and MNT respectively. While replacing N by CUR or C in RSVD, we can reduce the computational cost from O(m(2)n) to O(c(2)m + tcnk(c)) (where c is the size of data subspace, m and n are the size of the input matrix, t is the number of iterations, k(c) is the number of features). Because CUR is explicitly expressed in terms of a small number of actual columns and actual rows of the original data matrix, the result of matrix decomposition has better interpretability. The advantage of what we devised CUR+RSVD and C+RSVD collaborative prediction approaches is that, they not only can deal with the large scale matrix rapidly but also preserve the sparsity of the original matrix, more interestingly, our approaches have higher prediction accuracy. Experimental results on Movielens, Joke data set, etc. show that what we proposed methods can handle the sparsity issue on large-scale low-rank matrix effectively. Compared with RSVD, it can achieve far better prediction accuracy and recommendation result, meanwhile save about 70% training time at the same dataset.
引用
收藏
页码:615 / 640
页数:26
相关论文
共 33 条
[1]   CUR Decompositions, Similarity Matrices, and Subspace Clustering [J].
Aldroubi, Akram ;
Hamm, Keaton ;
Koku, Ahmet Bugra ;
Sekmen, Ali .
FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2019, 4
[2]   Novel predictive model to improve the accuracy of collaborative filtering recommender systems [J].
Alhijawi, Bushra ;
Al-Naymat, Ghazi ;
Obeid, Nadim ;
Awajan, Arafat .
INFORMATION SYSTEMS, 2021, 96
[3]  
[Anonymous], 2011, P FORTH INT C WEB SE, DOI DOI 10.1145/1935826.1935877
[4]   Fuzzy clustering in community detection based on nonnegative matrix factorization with two novel evaluation criteria [J].
Binesh, Neda ;
Rezghi, Mansoor .
APPLIED SOFT COMPUTING, 2018, 69 :689-703
[5]   Geographic-aware collaborative filtering for web service recommendation [J].
Botangen, Khavee Agustus ;
Yu, Jian ;
Sheng, Quan Z. ;
Han, Yanbo ;
Yongchareon, Sira .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 151
[6]   Optimal CUR Matrix Decompositions [J].
Boutsidis, Christos ;
Woodruff, David P. .
STOC'14: PROCEEDINGS OF THE 46TH ANNUAL 2014 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2014, :353-362
[7]   FAST LOW-RANK KERNEL MATRIX FACTORIZATION USING SKELETONIZED INTERPOLATION [J].
Cambier, Leopold ;
Darve, Eric .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2019, 41 (03) :A1652-A1680
[8]   Probabilistic Latent Semantic Analysis-Based Gear Fault Diagnosis Under Variable Working Conditions [J].
Chen, Chao ;
Shen, Fei ;
Xu, Jiawen ;
Yan, Ruqiang .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (06) :2845-2857
[9]  
Chen C, 2020, PR MACH LEARN RES, V108, P766
[10]   Feature selection for text classification: A review [J].
Deng, Xuelian ;
Li, Yuqing ;
Weng, Jian ;
Zhang, Jilian .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) :3797-3816