Movie genome: alleviating new item cold start in movie recommendation

被引:53
作者
Deldjoo, Yashar [1 ]
Dacrema, Maurizio Ferrari [2 ]
Constantin, L. Mihai Gabriel [4 ]
Eghbal-Zadeh, Hamid [6 ]
Cereda, Stefano [2 ]
Schedl, Markus [6 ]
Ionescu, Bogdan [5 ]
Cremonesi, Paolo [3 ]
机构
[1] Politecn Milan, Milan, Italy
[2] Politecn Milan, DEIB, Milan, Italy
[3] Politecn Milan, Recommender Syst, Comp Sci Dept, Milan, Italy
[4] Univ Politehn Bucuresti, Bucharest, Romania
[5] Univ Politehn Bucuresti, CAMPUS Res Ctr, Bucharest, Romania
[6] Johannes Kepler Univ Linz, Dept Computat Percept, Linz, Austria
关键词
Movie recommender systems; Cold start; Warm start; Semi-cold start; New item; Multimedia features; Content-based; Audio descriptors; Visual descriptors; Multimodal fusion; Hybrid recommender system; Feature weighting; Collaborative-enriched content-based filtering; Canonical correlations analysis; INFORMATION; SIMILARITY; DIVERSITY; FEATURES; SYSTEM;
D O I
10.1007/s11257-019-09221-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As of today, most movie recommendation services base their recommendations on collaborative filtering (CF) and/or content-based filtering (CBF) models that use metadata (e.g., genre or cast). In most video-on-demand and streaming services, however, new movies and TV series are continuously added. CF models are unable to make predictions in such a scenario, since the newly added videos lack interactionsa problem technically known as new item cold start (CS). Currently, the most common approach to this problem is to switch to a purely CBF method, usually by exploiting textual metadata. This approach is known to have lower accuracy than CF because it ignores useful collaborative information and relies on human-generated textual metadata, which are expensive to collect and often prone to errors. User-generated content, such as tags, can also be rare or absent in CS situations. In this paper, we introduce a new movie recommender system that addresses the new item problem in the movie domain by (i) integrating state-of-the-art audio and visual descriptors, which can be automatically extracted from video content and constitute what we call the movie genome; (ii) exploiting an effective data fusion method named canonical correlation analysis, which was successfully tested in our previous works Deldjoo et al. (in: International Conference on Electronic Commerce and Web Technologies. Springer, Berlin, pp 34-45, 2016b; Proceedings of the Twelfth ACM Conference on Recommender Systems. ACM, 2018b), to better exploit complementary information between different modalities; (iii) proposing a two-step hybrid approach which trains a CF model on warm items (items with interactions) and leverages the learned model on the movie genome to recommend cold items (items without interactions). Experimental validation is carried out using a system-centric study on a large-scale, real-world movie recommendation dataset both in an absolute cold start and in a cold to warm transition; and a user-centric online experiment measuring different subjective aspects, such as satisfaction and diversity. Results show the benefits of this approach compared to existing approaches.
引用
收藏
页码:291 / 343
页数:53
相关论文
共 116 条
[1]   Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques [J].
Adomavicius, Gediminas ;
Kwon, YoungOk .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) :896-911
[2]   Stability of Recommendation Algorithms [J].
Adomavicius, Gediminas ;
Zhang, Jingjing .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (04)
[3]  
Aggarwal C C., 2016, Recommender Systems, P139
[4]  
[Anonymous], PROCEEDINGS OF THE 9
[5]  
[Anonymous], 2017, P 15 INT WORKSH CONT
[6]  
[Anonymous], 2018, USER PREFERENCE ELIC
[7]  
[Anonymous], 2010, P 7 SOUND MUS COMP C
[8]  
[Anonymous], 2015, Recommender Systems Handbook
[9]  
[Anonymous], 1995, Asakura Shoten
[10]  
[Anonymous], 2011, 7 ANN MUSIC INFORM R