Large-Scale Distributed Bayesian Matrix Factorization using Stochastic Gradient MCMC

被引:26
|
作者
Ahn, Sungjin [1 ]
Korattikara, Anoop [2 ]
Liu, Nathan [3 ]
Rajan, Suju [3 ]
Welling, Max [4 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Google, Mountain View, CA USA
[3] Yahoo Labs, Sunnyvale, CA USA
[4] Univ Amsterdam, Amsterdam, Netherlands
来源
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2015年
关键词
Large-Scale; Distributed; Matrix Factorization; MCMC; Stochastic Gradient; Bayesian Inference;
D O I
10.1145/2783258.2783373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite having various attractive qualities such as high prediction accuracy and the ability to quantify uncertainty and avoid over fitting, Bayesian Matrix Factorization has not been widely adopted because of the prohibitive cost of inference. In this paper, we propose a scalable distributed Bayesian matrix factorization algorithm using stochastic gradient MCMC. Our algorithm, based on Distributed Stochastic Gradient Langevin Dynamics, can not only match the prediction accuracy of standard MCMC methods like Gibbs sampling, but at the same time is as fast and simple as stochastic gradient descent. In our experiments, we show that our algorithm can achieve the same level of prediction accuracy as Gibbs sampling an order of magnitude faster. We also show that our method reduces the prediction error as fast as distributed stochastic gradient descent, achieving a 4.1% improvement in RMSE for the Netflix dataset and an 1.8% for the Yahoo music dataset.
引用
收藏
页码:9 / 18
页数:10
相关论文
共 50 条
  • [1] Large-Scale Bayesian Probabilistic Matrix Factorization with Memo-Free Distributed Variational Inference
    Chen, Guangyong
    Zhu, Fengyuan
    Heng, Pheng Ann
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (03)
  • [2] Distributed Bayesian Inference for Large-Scale IoT Systems
    Vlachou, Eleni
    Karras, Aristeidis
    Karras, Christos
    Theodorakopoulos, Leonidas
    Halkiopoulos, Constantinos
    Sioutas, Spyros
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (01)
  • [3] Fast Nonparametric Matrix Factorization for Large-scale Collaborative Filtering
    Yu, Kai
    Zhu, Shenghuo
    Lafferty, John
    Gong, Yihong
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 211 - 218
  • [4] Efficient Parallel Stochastic Gradient Descent for Matrix Factorization Using GPU
    Nassar, Mohamed A.
    El-Sayed, Layla A. A.
    Taha, Yousry
    2016 11TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2016, : 63 - 68
  • [5] Stochastic Gradient Descent for Large-scale Linear Nonparallel SVM
    Tang, Jingjing
    Tian, Yingjie
    Wu, Guoqiang
    Li, Dewei
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 980 - 983
  • [6] On the benefits of the LDLT factorization for large-scale differential matrix equation solvers
    Lang, Norman
    Mena, Hermann
    Saak, Jens
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2015, 480 : 44 - 71
  • [7] Using Stochastic Gradient Decent Algorithm For Incremental Matrix Factorization In Recommendation System
    Nguyen, Si-Thin
    Kwak, Hyun-Young
    Lee, Seok-Hee
    Gim, Gwang-Yong
    2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 308 - 319
  • [8] MACAU: SCALABLE BAYESIAN FACTORIZATION WITH HIGH-DIMENSIONAL SIDE INFORMATION USING MCMC
    Simm, J.
    Arany, A.
    Zakeri, P.
    Haber, T.
    Wegner, J. K.
    Chupakhin, V.
    Ceulemans, H.
    Moreau, Y.
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [9] A STOCHASTIC NEWTON MCMC METHOD FOR LARGE-SCALE STATISTICAL INVERSE PROBLEMS WITH APPLICATION TO SEISMIC INVERSION
    Martin, James
    Wilcox, Lucas C.
    Burstedde, Carsten
    Ghattas, Omar
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2012, 34 (03) : A1460 - A1487
  • [10] Large-scale multi-view clustering via matrix factorization of consensus graph
    Yang, Zengbiao
    Tan, Yihua
    Yang, Tao
    PATTERN RECOGNITION, 2024, 155