Scalable collaborative-filtering with jointly derived neighborhood interpolation weights

被引:225
作者
Bell, Robert M. [1 ]
Koren, Yehuda [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
来源
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING | 2007年
关键词
D O I
10.1109/ICDM.2007.90
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommender systems based on collaborative filtering predict user preferences for products or services by learning past user-item relationships. A predominant approach to collaborative filtering is neighborhood based ("k-nearest neighbors"), where a user-item preference rating is interpolated from ratings of similar items and/or users. We enhance the neighborhood-based approach leading to substantial improvement of prediction accuracy, without a meaningful increase in running time. First, we remove certain so-called "global effects" from the data to make the ratings more comparable, thereby improving interpolation accuracy. Second, we show how to simultaneously derive interpolation weights for all nearest neighbors, unlike previous approaches where each weight is computed separately. By globally solving a suitable optimization problem, this simultaneous interpolation accounts for the many interactions between neighbors leading to improved accuracy. Our method is very fast in practice, generating a prediction in about 0.2 milliseconds. Importantly, it does not require training many parameters or a lengthy preprocessing, making it very practical for large scale applications. Finally, we show how to apply these methods to the perceivably much slower user-oriented approach. To this end, we suggest a novel scheme for low dimensional embedding of the users. We evaluate these methods on the Netflix dataset, where they deliver significantly better results than the commercial Netflix Cinematch recommender system.
引用
收藏
页码:43 / 52
页数:10
相关论文
共 15 条
  • [1] Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
    Adomavicius, G
    Tuzhilin, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) : 734 - 749
  • [2] [Anonymous], 2007, KDD CUP WORKSH
  • [3] Bell R. M., 2007, P 13 ACM SIGKDD INT
  • [4] DATA-ANALYSIS USING STEINS ESTIMATOR AND ITS GENERALIZATIONS
    EFRON, B
    MORRIS, C
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (350) : 311 - 319
  • [5] Funk S., 2006, NETFLIX UPDATE TRY T
  • [6] USING COLLABORATIVE FILTERING TO WEAVE AN INFORMATION TAPESTRY
    GOLDBERG, D
    NICHOLS, D
    OKI, BM
    TERRY, D
    [J]. COMMUNICATIONS OF THE ACM, 1992, 35 (12) : 61 - 70
  • [7] Eigentaste: A constant time collaborative filtering algorithm
    Goldberg, K
    Roeder, T
    Gupta, D
    Perkins, C
    [J]. INFORMATION RETRIEVAL, 2001, 4 (02): : 133 - 151
  • [8] An algorithmic framework for performing collaborative filtering
    Herlocker, JL
    Konstan, JA
    Borchers, A
    Riedl, J
    [J]. SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 230 - 237
  • [9] GroupLens: Applying collaborative filtering to Usenet news
    Konstan, JA
    Miller, BN
    Maltz, D
    Herlocker, JL
    Gordon, LR
    Riedl, J
    [J]. COMMUNICATIONS OF THE ACM, 1997, 40 (03) : 77 - 87
  • [10] Lawson CL., 1974, Solving Least Squares Problems