Shilling attack detection utilizing semi-supervised learning method for collaborative recommender system

被引:102
作者
Cao, Jie [1 ]
Wu, Zhiang [1 ]
Mao, Bo [1 ]
Zhang, Yanchun [2 ]
机构
[1] Nanjing Univ Finance & Econ, Jiangsu Prov Key Lab E Business, Nanjing, Peoples R China
[2] Victoria Univ, Sch Comp Sci & Math, Melbourne, Vic 8001, Australia
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2013年 / 16卷 / 5-6期
基金
对外科技合作项目(国际科技项目); 中国国家自然科学基金;
关键词
semi-supervised learning; shilling attack detection; collaborative filtering; naive Bayes; EM; CLASSIFICATION;
D O I
10.1007/s11280-012-0164-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collaborative filtering (CF) technique is capable of generating personalized recommendations. However, the recommender systems utilizing CF as their key algorithms are vulnerable to shilling attacks which insert malicious user profiles into the systems to push or nuke the reputations of targeted items. There are only a small number of labeled users in most of the practical recommender systems, while a large number of users are unlabeled because it is expensive to obtain their identities. In this paper, Semi-SAD, a new semi-supervised learning based shilling attack detection algorithm is proposed to take advantage of both types of data. It first trains a na < ve Bayes classifier on a small set of labeled users, and then incorporates unlabeled users with EM-lambda to improve the initial na < ve Bayes classifier. Experiments on MovieLens datasets are implemented to compare the efficiency of Semi-SAD with supervised learning based detector and unsupervised learning based detector. The results indicate that Semi-SAD can better detect various kinds of shilling attacks than others, especially against obfuscated and hybrid shilling attacks.
引用
收藏
页码:729 / 748
页数:20
相关论文
共 25 条
[1]  
[Anonymous], TECHNICAL REPORT
[2]  
Bell R. M., 2007, KDD CUP WORKSH 13 AC, P7, DOI DOI 10.1007/S007790170019
[3]  
Burke Robin, 2006, P 12 ACM SIGKDD INT, P542, DOI DOI 10.1145/1150402.1150465
[4]   Comparison of Collaborative Filtering Algorithms: Limitations of Current Techniques and Proposals for Scalable, High-Performance Recommender Systems [J].
Cacheda, Fidel ;
Carneiro, Victor ;
Fernandez, Diego ;
Formoso, Vreixo .
ACM TRANSACTIONS ON THE WEB, 2011, 5 (01)
[5]   ON THE EXPONENTIAL VALUE OF LABELED SAMPLES [J].
CASTELLI, V ;
COVER, TM .
PATTERN RECOGNITION LETTERS, 1995, 16 (01) :105-111
[6]   Exploring latent browsing graph for question answering recommendation [J].
Chiang, Meng-Fen ;
Peng, Wen-Chih ;
Yu, Philip S. .
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2012, 15 (5-6) :603-630
[7]  
Chirita Paul-Alexandru, 2005, P 7 ANN ACM INT WORK, P67
[8]  
Gunawardana A., 2009, Proceedings of the Third ACM Conference on Recommender Systems, V9, P117
[9]  
Hurley N., 2009, Proceedings of the third ACM conference on Recommender systems, P149, DOI DOI 10.1145/1639714.1639740
[10]  
Lam Shyong K., 2004, P 13 INT C WORLD WID, P393, DOI DOI 10.1145/988672.988726