A scalable privacy-preserving recommendation scheme via bisecting k-means clustering

被引:45
作者
Bilge, Alper [1 ]
Polat, Huseyin [1 ]
机构
[1] Anadolu Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkey
关键词
Accuracy; Binary decision diagrams; Clustering methods; Data preprocessing; Data privacy; Recommender systems; SYSTEMS;
D O I
10.1016/j.ipm.2013.02.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals' privacy. However, collaborative filtering with privacy schemes commonly suffer from scalability and sparseness as the content in the domain proliferates. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this work, we propose a novel privacy-preserving collaborative filtering scheme based on bisecting k-means clustering in which we apply two preprocessing methods. The first preprocessing scheme deals with scalability problem by constructing a binary decision tree through a bisecting k-means clustering approach while the second produces clones of users by inserting pseudo-self-predictions into original user profiles to boost accuracy of scalability-enhanced structure. Sparse nature of collections are handled by transforming ratings into item features-based profiles. After analyzing our scheme with respect to privacy and supplementary costs, we perform experiments on benchmark data sets to evaluate it in terms of accuracy and online performance. Our empirical outcomes verify that combined effects of the proposed preprocessing schemes relieve scalability and augment accuracy significantly. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:912 / 927
页数:16
相关论文
共 41 条
  • [1] A collaborative filtering method based on artificial immune network
    Acilar, A. Merve
    Arslan, Ahmet
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 8324 - 8332
  • [2] Ackerman M. S., 1999, P 1 ACM C EL COMM
  • [3] Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
    Adomavicius, G
    Tuzhilin, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) : 734 - 749
  • [4] Agrawal D., 2001, PROC 20 ACM SIGMOD S, P247, DOI [10.1145/375551.375602, DOI 10.1145/375551.375602]
  • [5] ALAMBIC:: a privacy-preserving recommender system for electronic commerce
    Aimeur, Esma
    Brassard, Gilles
    Fernandez, Jose M.
    Onana, Flavien Serge Mani
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2008, 7 (05) : 307 - 334
  • [6] Fuzzy-genetic approach to recommender systems based on a novel hybrid user model
    Al-Shamri, Mohammad Yahya H.
    Bharadwaj, Kamal K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 1386 - 1399
  • [7] Cluster searching strategies for collaborative recommendation systems
    Altingovde, Ismail Sengor
    Subakan, Oezlem Nurcan
    Ulusoy, Ozgur
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (03) : 688 - 697
  • [8] [Anonymous], 2000, KDD WORKSH TEXT MIN
  • [9] The impact of data obfuscation on the accuracy of collaborative filtering
    Berkovsky, Shlomo
    Kuflik, Tsvi
    Ricci, Francesco
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5033 - 5042
  • [10] Bilge Alper, 2010, Proceedings 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT), P143, DOI 10.1109/WI-IAT.2010.109