Spam review detection using self attention based CNN and bi-directional LSTM

被引:40
作者
Bhuvaneshwari, P. [1 ]
Rao, A. Nagaraja [1 ]
Robinson, Y. Harold [2 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore, Tamil Nadu, India
[2] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
关键词
E-commerce; Opinion spam reviews; Machine learning; Deep learning; Self attention-based CNN Bi-LSTM (ACB) model; Convolution neural network; Self-attention mechanism; Bidirectional long short term memory;
D O I
10.1007/s11042-021-10602-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion reviews are a valuable source of information in e-commerce. Indeed, it benefits users in buying decisions and businesses to enhance their quality. However, various greedy organizations employ spammers to post biased spam reviews to gain an advantage or to degrade the reputation of a competitor. This results in the explosive growth of opinion spamming. Due to its nature and their increasing volume, spam reviews are a fast-growing serious issue on the internet. Until now, researchers have developed many Machine Learning (ML) based methods to identify opinion spam reviews. However, the traditional ML methods cannot effectively detect spam messages due to the limited feature representations and the data manipulations done by spammers to escape from the detection mechanism. As an alternative to ML-based detection, in this paper, we proposed a Deep Learning (DL) based novel framework called Self Attention-based CNN Bi-LSTM (ACB) model to learn document level representation for identifying the spam reviews. Our approach computes the weightage of each word present in the sentence and identifies the spamming clues exists in the document with an attention mechanism. Then the model learns sentence representation by using Convolution Neural Network (CNN) and extracts the higher-level n-gram features. Then finally, sentence vectors are combined using Bi-directional LSTM (Bi-LSTM) as document feature vectors and identify the spam reviews with contextual information. The evaluated experiment results are compared with its variants and the result shows that ACB outperforms other variants in terms of classification accuracy.
引用
收藏
页码:18107 / 18124
页数:18
相关论文
共 32 条
[1]  
Crawford Michael, 2016, 20 9 INT FLAIRS C
[2]   An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews [J].
Dong, Lu-yu ;
Ji, Shu-juan ;
Zhang, Chun-jin ;
Zhang, Qi ;
Chiu, DicksonK. W. ;
Qiu, Li-Qing ;
Li, Da .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 :210-223
[3]   Support vector machines for spam categorization [J].
Drucker, H ;
Wu, DH ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1048-1054
[4]   An approach to the use of word embeddings in an opinion classification task [J].
Enriquez, Fernando ;
Troyano, Jose A. ;
Lopez-Solaz, Tomas .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 66 :1-6
[5]   Self Multi-Head Attention-based Convolutional Neural Networks for fake news detection [J].
Fang, Yong ;
Gao, Jian ;
Huang, Cheng ;
Peng, Hua ;
Wu, Runpu .
PLOS ONE, 2019, 14 (09)
[6]  
Feng S., 2012, P 50 ANN M ASS COMP, V2, P171, DOI DOI 10.5555/2390665.2390708
[7]   A Model of Visual Attention for Natural Image Retrieval [J].
Liu, Guanghai ;
Fan, Dengping .
PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CLOUD COMPUTING COMPANION (ISCC-C), 2014, :728-733
[8]  
Harris C.G., 2012, WORKSH 26 AAAI C ART, VWS-12-08, P87
[9]   Detection of review spam: A survey [J].
Heydari, Atefeh ;
Tavakoli, Mohammad Ali ;
Salim, Naomie ;
Heydari, Zahra .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (07) :3634-3642
[10]  
Horrigan J., 2008, Online Shopping