Tuning Word2vec for Large Scale Recommendation Systems

被引:10
作者
Chamberlain, Benjamin P. [1 ]
Rossi, Emanuele [1 ]
Shiebler, Dan [1 ]
Sedhain, Suvash [1 ]
Bronstein, Michael M. [1 ]
机构
[1] Twitter Inc, San Francisco, CA 94103 USA
来源
RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS | 2020年
关键词
Recommender System Evaluation; Embeddings; Neural Networks; Hyperparameter Optimization;
D O I
10.1145/3383313.3418486
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word2vec is a powerful machine learning tool that emerged from Natural Language Processing (NLP) and is now applied in multiple domains, including recommender systems, forecasting, and network analysis. As Word2vec is often used off the shelf, we address the question of whether the default hyperparameters are suitable for recommender systems. The answer is emphatically no. In this paper, we first elucidate the importance of hyperparameter optimization and show that unconstrained optimization yields an average 221% improvement in hit rate over the default parameters. However, unconstrained optimization leads to hyperparameter settings that are very expensive and not feasible for large scale recommendation tasks. To this end, we demonstrate 138% average improvement in hit rate with a runtime budget-constrained hyperparameter optimization. Furthermore, to make hyperparameter optimization applicable for large scale recommendation problems where the target dataset is too large to search over, we investigate generalizing hyperparameters settings from samples. We show that applying constrained hyperparameter optimization using only a 10% sample of the data still yields a 91% average improvement in hit rate over the default parameters when applied to the full datasets. Finally, we apply hyperparameters learned using our method of constrained optimization on a sample to the Who To Follow recommendation service at Twitter and are able to increase follow rates by 15%.
引用
收藏
页码:732 / 737
页数:6
相关论文
共 23 条
[1]  
[Anonymous], 2013, PROC WWW
[2]  
[Anonymous], 2012, P 29 INT C MACH LEAR
[3]  
Antonov I. A., 1979, USSR Computational Mathematics and Mathematical Physics, V19, P252, DOI DOI 10.1016/0041-5553(79)90085-5
[4]  
Barkan O, 2016, 2016 IEEE 26 INT WOR, P1, DOI [10.1109/MLSP.2016.7738886, DOI 10.1109/MLSP.2016.7738886]
[5]  
Belli Luca, 2004, ARXIV200413715
[6]  
Bodon Ferenc, 2003, WORKSH FREQ IT MIN I, V3, P63
[7]   Word2vec applied to Recommendation: Hyperparameters Matter [J].
Caselles-Dupre, Hugo ;
Lesaint, Florian ;
Royo-Letelier, Jimena .
12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, :352-356
[8]   Customer Lifetime Value Prediction Using Embeddings [J].
Chamberlain, Benjamin Paul ;
Cardoso, Angelo ;
Liu, C. H. Bryan ;
Pagliari, Roberto ;
Deisenroth, Marc Peter .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1753-1762
[9]  
Chen D., 2012, Journal of Database Marketing & Customer Strategy Management, V19, P197, DOI DOI 10.1057/DBM.2012.17
[10]   Real-time Personalization using Embeddings for Search Ranking at Airbnb [J].
Grbovic, Mihajlo ;
Cheng, Haibin .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :311-320