Towards Deeper Understanding of Variational Auto-encoders for Binary Collaborative Filtering

被引:2
作者
Zamani, Siamak [1 ]
Li, Dingcheng [1 ]
Fei, Hongliang [1 ]
Li, Ping [1 ]
机构
[1] Baidu Res, Cognit Comp Lab, 10900 NE 8th St, Bellevue, WA 98004 USA
来源
PROCEEDINGS OF THE 2022 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2022 | 2022年
关键词
Recommendation systems; Variational auto-encoders; collaborative filtering; deep generative models; n-Choose-k model;
D O I
10.1145/3539813.3545145
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommendation systems are an integral component of machine learning, wherein collaborative filtering (CF) is among the most prominent algorithms employed. Recently, variational auto-encoders (VAEs) with multinomial likelihood and weighted Kullback-Leibler (KL) regularization (referred to as Mult-VAE) provide state-of-the-art performance for collaborative filtering of binary data. To gain deeper insight into the objective function of Mult-VAE, we build a connection between the reconstruction term of Mult-VAE objective and the objective function of the probabilistic n-Choose-k model for ranking prediction. In particular, we theoretically demonstrate that the negative reconstruction error of Mult-VAE is a lower bound to the log-likelihood of the binary n-Choose-k model. Hence, Mult-VAE can be interpreted as an approximate proxy to the n-Choose-k model. We also empirically show the essential role of this reconstruction term of evidence lower bound in the context of collaborative filtering on multiple real-world datasets. Finally, inspired by the role of the weighted KL term in maximizing mutual information between observed ratings and latent variables, we propose a semi-implicit VAE framework with superior performance in terms of ranking metrics.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 58 条
[1]   Information Dropout: Learning Optimal Representations Through Noisy Computation [J].
Achille, Alessandro ;
Soatto, Stefano .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) :2897-2905
[2]  
Alemi Alexander, 2018, INT C MACH LEARN, P159
[3]  
Bennett James, 2007, PROC KDD CUP WORKSHO
[4]  
Bertin-Mahieux T., 2011, Proceedings of the 12th International Society for Music Information Retrieval Conference p, DOI 10.7916/D8NZ8J07
[5]  
Burda Y., 2016, P 4 INT C LEARN REPR
[6]  
Burges C. J. C., 2007, Adv. Neural Inf. Process. Syst, V20, P897
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Dieng AB, 2019, PR MACH LEARN RES, V89
[9]  
Doersch C, 2021, Arxiv, DOI arXiv:1606.05908
[10]   MOBIUS: Towards the Next Generation of Query-Ad Matching in Baidu's Sponsored Search [J].
Fan, Miao ;
Guo, Jiacheng ;
Zhu, Shuai ;
Miao, Shuo ;
Sun, Mingming ;
Li, Ping .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2509-2517