NetSpam: A Network-Based Spam Detection Framework for Reviews in Online Social Media

被引:98
作者
Shehnepoor, Saeedreza [1 ]
Salehi, Mostafa [1 ]
Farahbakhsh, Reza [2 ]
Crespi, Noel [2 ]
机构
[1] Univ Tehran, Tehran 1439957131, Iran
[2] Telecom Sud Paris, Inst Mines Telecom, F-91011 Paris, France
基金
美国国家科学基金会;
关键词
Social media; social network; spammer; spam review; fake review; heterogeneous information networks; FAKE;
D O I
10.1109/TIFS.2017.2675361
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, a big part of people rely on available content in social media in their decisions (e.g., reviews and feedback on a topic or product). The possibility that anybody can leave a review provides a golden opportunity for spammers to write spam reviews about products and services for different interests. Identifying these spammers and the spam content is a hot topic of research, and although a considerable number of studies have been done recently toward this end, but so far the methodologies put forth still barely detect spam reviews, and none of them show the importance of each extracted feature type. In this paper, we propose a novel framework, named NetSpam, which utilizes spam features for modeling review data sets as heterogeneous information networks to map spam detection procedure into a classification problem in such networks. Using the importance of spam features helps us to obtain better results in terms of different metrics experimented on real-world review data sets from Yelp and Amazon Web sites. The results show that NetSpam outperforms the existing methods and among four categories of features, including review-behavioral, user-behavioral, review-linguistic, and user-linguistic, the first type of features performs better than the other categories.
引用
收藏
页码:1585 / 1595
页数:11
相关论文
共 37 条
[31]  
Sun Y., 2012, P ICCCE, P159
[32]  
Sun Y.F., 2009, Proc.,71st EAGE Conference, Amsterdam, Netherlands, P1
[33]  
Viswanath B., 2014, P USENIX, P1
[34]  
Wahyuni E. D., 2016, P MATEC WEB C, P1
[35]  
Weise Karen, A Lie Detector Test for Online Reviewers-Bloomberg
[36]  
Xu C., 2014, P SIAM INT C DAT MIN, P172
[37]   Trust-Aware Review Spam Detection [J].
Xue, Hao ;
Li, Fengjun ;
Seo, Hyunjin ;
Pluretti, Roseann .
2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 1, 2015, :726-733