Understanding Large-Scale Network Effects in Detecting Review Spammers

被引:4
作者
Rout, Jitendra Kumar [1 ]
Sahoo, Kshira Sagar [2 ]
Dalmia, Anmol
Bakshi, Sambit [3 ]
Bilal, Muhammad [4 ]
Song, Houbing [5 ]
机构
[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur 492010, India
[2] Umea Univ, Dept Comp Sci, SE-90187 Umea, Sweden
[3] Natl Inst Technol Rourkela, Dept Comp Sci & Engn, Rourkela 769008, Odisha, India
[4] Hankuk Univ Foreign Studies, Dept Comp Engn, Yongin 17035, South Korea
[5] Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
关键词
Feature extraction; Behavioral sciences; Analytical models; Writing; Unsolicited e-mail; Sentiment analysis; Scalability; Online review spam; opinion spam; review graphs; spam detection; unlabeled review; FRAMEWORK;
D O I
10.1109/TCSS.2023.3243139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion spam detection is a challenge for online review systems and social forum operators. Opinion spamming costs businesses and people money since it deceives customers as well as automated opinion mining and sentiment analysis systems by bestowing undeserved positive opinions on target firms and/or bestowing fake negative opinions on others. One popular detection approach is to model a review system as a network of users, products, and reviews, for example using review graph models. In this article, we study the effects of network scale on network-based review spammer detection models, specifically on the trust model and the SpammerRank model. We then evaluate both network models using two large publicly available review datasets, namely: the Amazon dataset (containing 6 million reviews by more than 2 million reviewers) and the UCSD dataset (containing over 82 million reviews by 21 million reviewers). It has been observed thatSpammerRank model provides a better scaling time for applications requiring reviewer indicators and in case of trust model distributions are flattening out indicating variance of reviews with respect to spamming. Detailed observations on the scaling effects of these models are reported in the result section.
引用
收藏
页码:4994 / 5004
页数:11
相关论文
共 53 条
[1]  
Akoglu Leman., 2013, ICWSM, P2
[2]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI DOI 10.1145/1341531.1341560
[3]   Opinion spam detection framework using hybrid classification scheme [J].
Asghar, Muhammad Zubair ;
Ullah, Asmat ;
Ahmad, Shakeel ;
Khan, Aurangzeb .
SOFT COMPUTING, 2020, 24 (05) :3475-3498
[4]  
Aye C. M., 2014, PROC INT C ADV ENG T, P350, DOI DOI 10.15242/IIE.E0314158
[5]   Collusion-aware detection of review spammers in location based social networks [J].
Cao, Jiuxin ;
Xia, Rongqing ;
Guo, Yifang ;
Ma, Zhuo .
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (06) :2921-2951
[6]   Spammer Group Detection Using Machine Learning Technology for Observation of New Spammer Behavioral Features [J].
Cheng, Li-Chen ;
Hu, Hsiao-Wei ;
Wu, Chia-Chi .
JOURNAL OF GLOBAL INFORMATION MANAGEMENT, 2021, 29 (02) :61-76
[7]  
Crawford M., 2015, J. Big Data., V2, P1, DOI DOI 10.1186/S40537-015-0029-9
[8]   State-of-art approaches for review spammer detection: a survey [J].
Dewang, Rupesh Kumar ;
Singh, Anil Kumar .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (02) :231-264
[9]  
Fayazbakhsh S. K., 2012, 590 CSE STON BROOK U
[10]  
Fei Geli., 2013, ICWSM, DOI DOI 10.1609/ICWSM.V7I1.14400,1