Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining

被引:105
作者
Hajek, Petr [1 ]
Barushka, Aliaksandr [1 ]
Munk, Michal [2 ]
机构
[1] Univ Pardubice, Inst Syst Engn & Informat, Fac Econ & Adm, Studentska 84, Pardubice 53210, Czech Republic
[2] Constantine Philosopher Univ Nitra, Dept Comp Sci, Nitra 94974, Slovakia
关键词
Neural network; Deep learning; Fake review; Review spam; Word embedding; Emotion; OPINION SPAM DETECTION; SENTIMENT ANALYSIS; PRODUCT REVIEWS; SOCIAL NETWORKS; FRAMEWORK; TEXT;
D O I
10.1007/s00521-020-04757-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fake consumer review detection has attracted much interest in recent years owing to the increasing number of Internet purchases. Existing approaches to detect fake consumer reviews use the review content, product and reviewer information and other features to detect fake reviews. However, as shown in recent studies, the semantic meaning of reviews might be particularly important for text classification. In addition, the emotions hidden in the reviews may represent another potential indicator of fake content. To improve the performance of fake review detection, here we propose two neural network models that integrate traditional bag-of-words as well as the word context and consumer emotions. Specifically, the models learn document-level representation by using three sets of features: (1) n-grams, (2) word embeddings and (3) various lexicon-based emotion indicators. Such a high-dimensional feature representation is used to classify fake reviews into four domains. To demonstrate the effectiveness of the presented detection systems, we compare their classification performance with several state-of-the-art methods for fake review detection. The proposed systems perform well on all datasets, irrespective of their sentiment polarity and product category.
引用
收藏
页码:17259 / 17274
页数:16
相关论文
共 76 条
[1]   Detecting opinion spams and fake news using text classification [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
SECURITY AND PRIVACY, 2018, 1 (01)
[2]   Sentiment Analysis Over Social Networks: An Overview [J].
Ahmed, Khaled ;
El Tazi, Neamat ;
Hossny, Ahmad Hany .
2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, :2174-2179
[3]  
[Anonymous], 2011, P IEEE 11 INT C DAT, DOI DOI 10.1109/ICDM.2011.124
[4]  
[Anonymous], 2015, P INT AAAI C WEB SOC
[5]  
[Anonymous], 2018, TIMES
[6]  
Baccianella S., 2010, LREC 2010 7 INT C LA, P2200
[7]   A framework for fake review detection in online consumer electronics retailers [J].
Barbado, Rodrigo ;
Araque, Oscar ;
Iglesias, Carlos A. .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) :1234-1244
[8]  
Barushka A., 2019, IFIP International Conference on Artificial Intelligence Applications and Innovations, P340
[9]   Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning [J].
Barushka, Aliaksandr ;
Hajek, Petr .
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 :38-49
[10]   Spam filtering using integrated distribution-based balancing approach and regularized deep neural networks [J].
Barushka, Aliaksandr ;
Hajek, Petr .
APPLIED INTELLIGENCE, 2018, 48 (10) :3538-3556