Spam review detection using self attention based CNN and bi-directional LSTM

被引:40
作者
Bhuvaneshwari, P. [1 ]
Rao, A. Nagaraja [1 ]
Robinson, Y. Harold [2 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore, Tamil Nadu, India
[2] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
关键词
E-commerce; Opinion spam reviews; Machine learning; Deep learning; Self attention-based CNN Bi-LSTM (ACB) model; Convolution neural network; Self-attention mechanism; Bidirectional long short term memory;
D O I
10.1007/s11042-021-10602-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion reviews are a valuable source of information in e-commerce. Indeed, it benefits users in buying decisions and businesses to enhance their quality. However, various greedy organizations employ spammers to post biased spam reviews to gain an advantage or to degrade the reputation of a competitor. This results in the explosive growth of opinion spamming. Due to its nature and their increasing volume, spam reviews are a fast-growing serious issue on the internet. Until now, researchers have developed many Machine Learning (ML) based methods to identify opinion spam reviews. However, the traditional ML methods cannot effectively detect spam messages due to the limited feature representations and the data manipulations done by spammers to escape from the detection mechanism. As an alternative to ML-based detection, in this paper, we proposed a Deep Learning (DL) based novel framework called Self Attention-based CNN Bi-LSTM (ACB) model to learn document level representation for identifying the spam reviews. Our approach computes the weightage of each word present in the sentence and identifies the spamming clues exists in the document with an attention mechanism. Then the model learns sentence representation by using Convolution Neural Network (CNN) and extracts the higher-level n-gram features. Then finally, sentence vectors are combined using Bi-directional LSTM (Bi-LSTM) as document feature vectors and identify the spam reviews with contextual information. The evaluated experiment results are compared with its variants and the result shows that ACB outperforms other variants in terms of classification accuracy.
引用
收藏
页码:18107 / 18124
页数:18
相关论文
共 32 条
[21]   Spam review detection using spiral cuckoo search clustering method [J].
Pandey, Avinash Chandra ;
Rajpoot, Dharmveer Singh .
EVOLUTIONARY INTELLIGENCE, 2019, 12 (02) :147-164
[22]   Collective Opinion Spam Detection: Bridging Review Networks and Metadata [J].
Rayana, Shebuti ;
Akoglu, Leman .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :985-994
[23]  
Ren Y., 2016, P COLING 2016 26 INT, P140
[24]   Neural networks for deceptive opinion spam detection: An empirical study [J].
Ren, Yafeng ;
Ji, Donghong .
INFORMATION SCIENCES, 2017, 385 :213-224
[25]  
Sedighi Z., 2019, 32 INT FLAIRS C
[26]  
Wang C.-C., 2018, P 2 INT C E COMM E B, P16, DOI DOI 10.1145/3234781.3234794
[27]  
Wang X., 2017, P NAT CCF C NAT LANG, P866
[28]   Discovering Opinion Spammer Groups by Network Footprints [J].
Ye, Junting ;
Akoglu, Leman .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284 :267-282
[29]   Deep learning for sentiment analysis: A survey [J].
Zhang, Lei ;
Wang, Shuai ;
Liu, Bing .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (04)
[30]   DRI-RCNN: An approach to deceptive review identification using recurrent convolutional neural network [J].
Zhang, Wen ;
Du, Yuhang ;
Yoshida, Taketoshi ;
Wang, Qing .
INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (04) :576-592