One-Class Collaborative Filtering

被引:678
作者
Pan, Rong [1 ]
Zhou, Yunhong [2 ]
Cao, Bin [3 ]
Liu, Nathan N. [3 ]
Lukose, Rajan [1 ]
Scholz, Martin [1 ]
Yang, Qiang [3 ]
机构
[1] HP Labs, 1501 Page Mill Rd, Palo Alto, CA 94304 USA
[2] Rocket Fuel Inc, Redwood Shores, CA 94065 USA
[3] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
来源
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2008年
关键词
D O I
10.1109/ICDM.2008.16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications of collaborative filtering (CF), such as news item recommendation and bookmark recommendation, are most naturally thought of as one-class collaborative filtering (OCCF) problems. In these problems, the training data usually consist simply of binary data reflecting a user's action or inaction, such as page visitation in the case of news item recommendation or webpage bookmarking in the bookmarking scenario. Usually this kind of data are extremely sparse (a small fraction are positive examples), therefore ambiguity arises in the interpretation of the non-positive examples. Negative examples and unlabeled positive examples are mixed together and we are typically unable to distinguish them. For example, we cannot really attribute a user not bookmarking a page to a lack of interest or lack of awareness of the page. Previous research addressing this one-class problem only considered it as a classification task. In this paper, we consider the one-class problem under the CF setting. We propose two frameworks to tackle OCCF. One is based on weighted low rank approximation the other is based on negative example sampling. The experimental results show that our approaches significantly outperform the baselines.
引用
收藏
页码:502 / +
页数:3
相关论文
共 30 条
[21]  
Salakhutdinov R., 2007, P 24 INT C MACH LEAR, P791
[22]   Estimating the support of a high-dimensional distribution [J].
Schölkopf, B ;
Platt, JC ;
Shawe-Taylor, J ;
Smola, AJ ;
Williamson, RC .
NEURAL COMPUTATION, 2001, 13 (07) :1443-1471
[23]  
Schwab I., 2001, Learning user interests through positive examples using content analysis and collaborative filtering
[24]  
Srebro N., 2003, ICML, P720
[25]   FASTER METHODS FOR RANDOM SAMPLING [J].
VITTER, JS .
COMMUNICATIONS OF THE ACM, 1984, 27 (07) :703-718
[26]  
WARD G, 2008, BIOMETRICS
[27]  
Weiss G.M., 2004, ACM Sigkdd Explor.Newslett., V6, P7, DOI DOI 10.1145/1007730.1007734
[28]  
Yu Hwanjo, 2002, P 8 ACM SIGKDD INT C, P239, DOI DOI 10.1145/775047.775083
[29]  
ZBOU Y, 2008, LNCS, V5034, P337
[30]   Using singular value decomposition approximation for collaborative filtering [J].
Zhang, S ;
Wang, WH ;
Ford, J ;
Makedon, F ;
Pearlman, J .
CEC 2005: Seventh IEEE International Conference on E-Commerce Technology, Proceedings, 2005, :257-264