Detecting and Characterizing Extremist Reviewer Groups in Online Product Reviews

被引:14
作者
Gupta, Viresh [1 ]
Aggarwal, Aayush [1 ]
Chakraborty, Tanmoy [1 ]
机构
[1] IIIT Delhi, Dept Comp Sci & Engn, New Delhi 110020, India
关键词
Extremities; Feature extraction; Writing; Unsolicited e-mail; Annotations; Itemsets; Task analysis; Behaviour; electronic commerce; machine intelligence; machine learning; reviews; social computing; web mining; CUSTOMER REVIEWS; STRENGTH;
D O I
10.1109/TCSS.2020.2988098
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online marketplaces often witness opinion spam in the form of reviews. People are often hired to target specific brands for promoting or impeding them by writing highly positive or negative reviews. This often is done collectively in groups. Although some previous studies attempted to identify and analyze such opinion spam groups, little has been explored to spot those groups who target a brand as a whole, instead of just products. In this article, we collected the reviews from the Amazon product review site and manually labeled a set of 923 candidate reviewer groups. The groups are extracted using frequent itemset mining over brand similarities such that users are clustered together if they have mutually reviewed (products of) a lot of brands. We hypothesize that the nature of the reviewer groups is dependent on eight features specific to a (group, brand) pair. We develop a feature-based supervised model to classify candidate groups as extremist entities. We run multiple classifiers for the task of classifying a group based on the reviews written by the users of that group to determine whether the group shows signs of extremity. A three-layer perceptron-based classifier turns out to be the best classifier. We further study behaviors of such groups in detail to understand the dynamics of brand-level opinion fraud better. These behaviors include consistency in ratings, review sentiment, verified purchase, review dates, and helpful votes received on reviews. Surprisingly, we observe that there are a lot of verified reviewers showing extreme sentiment, which, on further investigation, leads to ways to circumvent the existing mechanisms in place to prevent unofficial incentives on Amazon.
引用
收藏
页码:741 / 750
页数:10
相关论文
共 59 条
[1]  
Almahairi A., 2018, ARXIV180606875
[2]  
Amazon.in, 2018, REV COMM GUID
[3]  
[Anonymous], 2006, CIKM, DOI DOI 10.1145/1183614.1183625
[4]  
[Anonymous], 2011, P IEEE 11 INT C DAT, DOI DOI 10.1109/ICDM.2011.124
[5]  
[Anonymous], 2013, P 6 WORKSHOP PHD STU
[6]  
[Anonymous], 2011, P 49 ANN M ASS COMP
[7]  
[Anonymous], 2005, P C HUM LANG TECHN E
[8]  
[Anonymous], P 7 C INF LANG RES E
[9]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[10]   Frequent item set mining [J].
Borgelt, Christian .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) :437-456