Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining

被引:105
作者
Hajek, Petr [1 ]
Barushka, Aliaksandr [1 ]
Munk, Michal [2 ]
机构
[1] Univ Pardubice, Inst Syst Engn & Informat, Fac Econ & Adm, Studentska 84, Pardubice 53210, Czech Republic
[2] Constantine Philosopher Univ Nitra, Dept Comp Sci, Nitra 94974, Slovakia
关键词
Neural network; Deep learning; Fake review; Review spam; Word embedding; Emotion; OPINION SPAM DETECTION; SENTIMENT ANALYSIS; PRODUCT REVIEWS; SOCIAL NETWORKS; FRAMEWORK; TEXT;
D O I
10.1007/s00521-020-04757-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fake consumer review detection has attracted much interest in recent years owing to the increasing number of Internet purchases. Existing approaches to detect fake consumer reviews use the review content, product and reviewer information and other features to detect fake reviews. However, as shown in recent studies, the semantic meaning of reviews might be particularly important for text classification. In addition, the emotions hidden in the reviews may represent another potential indicator of fake content. To improve the performance of fake review detection, here we propose two neural network models that integrate traditional bag-of-words as well as the word context and consumer emotions. Specifically, the models learn document-level representation by using three sets of features: (1) n-grams, (2) word embeddings and (3) various lexicon-based emotion indicators. Such a high-dimensional feature representation is used to classify fake reviews into four domains. To demonstrate the effectiveness of the presented detection systems, we compare their classification performance with several state-of-the-art methods for fake review detection. The proposed systems perform well on all datasets, irrespective of their sentiment polarity and product category.
引用
收藏
页码:17259 / 17274
页数:16
相关论文
共 76 条
[11]   Spam Filtering Using Regularized Neural Networks with Rectified Linear Units [J].
Barushka, Aliaksandr ;
Hajek, Petr .
AI*IA 2016: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2016, 10037 :65-75
[12]  
Bravo-Marquez F, 2016, 2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), P536, DOI [10.1109/WI.2016.0091, 10.1109/WI.2016.90]
[13]   Meta-level sentiment models for big social data analysis [J].
Bravo-Marquez, Felipe ;
Mendoza, Marcelo ;
Poblete, Barbara .
KNOWLEDGE-BASED SYSTEMS, 2014, 69 :86-99
[14]   Ranking learning algorithms: Using IBL and meta-learning on accuracy and time results [J].
Brazdil, PB ;
Soares, C ;
Da Costa, JP .
MACHINE LEARNING, 2003, 50 (03) :251-277
[15]  
BrightLocal, 2018, LOC CONS REV SURV 20
[16]  
Chandy R.H. Gu., 2012, P 2 JOINT WICOWAIRWE, P56
[17]   Harvesting Opinions and Emotions from Social Media Textual Resources [J].
Chatzakou, Despoina ;
Vakali, Athena .
IEEE INTERNET COMPUTING, 2015, 19 (04) :46-50
[18]   A study on real-time low-quality content detection on Twitter from the users' perspective [J].
Chen, Weiling ;
Yeo, Chai Kiat ;
Lau, Chiew Tong ;
Lee, Bu Sung .
PLOS ONE, 2017, 12 (08)
[19]  
Crawford M., 2015, Journal of Big Data, V2, P1
[20]  
ELMURNGI E, 2017, 7 INT C INN COMP TEC, P107, DOI [DOI 10.1109/INTECH.2017.8102442, 10.1109/INTECH.2017.8102442]