AN OPTIMAL SCALING FRAMEWORK FOR COLLABORATIVE FILTERING RECOMMENDATION SYSTEMS

被引:0
作者
Vozalis, Manolis G. [1 ]
Markos, Angelos I. [2 ]
Margaritis, Konstantinos G. [1 ]
机构
[1] Univ Macedonia, Dept Appl Informat, Thessaloniki 54006, Greece
[2] Democritus Univ Thrace, Dept Primary Educ, Alexandroupolis, Greece
关键词
Collaborative filtering; recommender systems; low-rank approximation; optimization algorithms; categorical principal component analysis; optimal scaling; PRINCIPAL-COMPONENTS-ANALYSIS;
D O I
10.1142/S0218213012500339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collaborative Filtering (CF) is a popular technique employed by Recommender Systems, a term used to describe intelligent methods that generate personalized recommendations. Some of the most efficient approaches to CF are based on latent factor models and nearest neighbor methods, and have received considerable attention in recent literature. Latent factor models can tackle some fundamental challenges of CF, such as data sparsity and scalability. In this work, we present an optimal scaling framework to address these problems using Categorical Principal Component Analysis (CatPCA) for the low-rank approximation of the user-item ratings matrix, followed by a neighborhood formation step. CatPCA is a versatile technique that utilizes an optimal scaling process where original data are transformed so that their overall variance is maximized. We considered both smooth and non-smooth transformations for the observed variables (items), such as numeric, (spline) ordinal, (spline) nominal and multiple nominal. The method was extended to handle missing data and incorporate differential weighting for items. Experiments were executed on three data sets of different sparsity and size, MovieLens 100k, 1M and Jester, aiming to evaluate the aforementioned options in terms of accuracy. A combined approach with a multiple nominal transformation and a "passive" missing data strategy clearly outperformed the other tested options for all three data sets. The results are comparable with those reported for single methods in the CF literature.
引用
收藏
页数:26
相关论文
共 42 条
[1]   MODELING ITEM-ITEM SIMILARITIES FOR PERSONALIZED RECOMMENDATIONS ON YAHOO! FRONT PAGE [J].
Agarwal, Deepak ;
Zhang, Liang ;
Mazumder, Rahul .
ANNALS OF APPLIED STATISTICS, 2011, 5 (03) :1839-1875
[2]  
[Anonymous], 2008, P 14 ACM SIGKDD INT
[3]  
aszy Pil, 2010, P 4 ACM C REC SYST, P71, DOI DOI 10.1145/1864708.1864726
[4]  
Bakker Arno., 2009, Proceeding of the 1st ACM international workshop on Complex networks meet information and knowledge management (CNIKM'09), P67
[5]   Scalable collaborative-filtering with jointly derived neighborhood interpolation weights [J].
Bell, Robert M. ;
Koren, Yehuda .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :43-52
[6]  
Bell Robert M., 2007, Acm Sigkdd Explorations Newsletter, V9, P75
[7]   Hybrid recommender systems: Survey and experiments [J].
Burke, R .
USER MODELING AND USER-ADAPTED INTERACTION, 2002, 12 (04) :331-370
[8]   Mining performance data through nonlinear PCA with optimal scaling [J].
Costantini, Paola ;
Linting, Marielle ;
Porzio, Giovanni C. .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2010, 26 (01) :85-101
[9]  
de Leeuw J, 2009, J STAT SOFTW, V31, P1
[10]  
de Leeuw J, 2006, STAT SOC BEHAV SCI, P107