A scalable privacy-preserving recommendation scheme via bisecting k-means clustering

被引:45
作者
Bilge, Alper [1 ]
Polat, Huseyin [1 ]
机构
[1] Anadolu Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkey
关键词
Accuracy; Binary decision diagrams; Clustering methods; Data preprocessing; Data privacy; Recommender systems; SYSTEMS;
D O I
10.1016/j.ipm.2013.02.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals' privacy. However, collaborative filtering with privacy schemes commonly suffer from scalability and sparseness as the content in the domain proliferates. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this work, we propose a novel privacy-preserving collaborative filtering scheme based on bisecting k-means clustering in which we apply two preprocessing methods. The first preprocessing scheme deals with scalability problem by constructing a binary decision tree through a bisecting k-means clustering approach while the second produces clones of users by inserting pseudo-self-predictions into original user profiles to boost accuracy of scalability-enhanced structure. Sparse nature of collections are handled by transforming ratings into item features-based profiles. After analyzing our scheme with respect to privacy and supplementary costs, we perform experiments on benchmark data sets to evaluate it in terms of accuracy and online performance. Our empirical outcomes verify that combined effects of the proposed preprocessing schemes relieve scalability and augment accuracy significantly. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:912 / 927
页数:16
相关论文
共 41 条
  • [21] Gan GJ, 2007, ASA SIAM SER STAT AP, V20, P3
  • [22] Gui-Rong Xue, 2005, SIGIR 2005. Proceedings of the Twenty-Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P114
  • [23] Evaluating collaborative filtering recommender systems
    Herlocker, JL
    Konstan, JA
    Terveen, K
    Riedl, JT
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 5 - 53
  • [24] Huang Z., 2005, P 2005 ACM SIGMOD IN, P37, DOI DOI 10.1145/1066157.1066163
  • [25] Privacy practices of Internet users: Self-reports versus observed behavior
    Jensen, C
    Potts, C
    Jensen, C
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2005, 63 (1-2) : 203 - 227
  • [26] Kargupta H, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P99
  • [27] Lathia N, 2007, RECSYS 07: PROCEEDINGS OF THE 2007 ACM CONFERENCE ON RECOMMENDER SYSTEMS, P1
  • [28] A hybrid collaborative filtering method for multiple-interests and multiple-content recommendation in E-Commerce
    Li, Y
    Lu, L
    Li, XF
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (01) : 67 - 77
  • [29] Amazon.com recommendation - Item-to-item collaborative filtering
    Linden, G
    Smith, B
    York, J
    [J]. IEEE INTERNET COMPUTING, 2003, 7 (01) : 76 - 80
  • [30] Miyahara K., 2000, PRICAI 2000. Topics in Artificial Intelligence. 6th Pacific Rim International Conference on Artificial Intelligence. Proceedings (Lecture Notes in Artificial Intelligence Vol.1886), P679