Towards automatic filtering of fake reviews

被引:61
作者
Cardoso, Emerson F. [1 ]
Silva, Renato M. [1 ]
Almeida, Tiago A. [1 ]
机构
[1] Fed Univ Sao Carlos UFSCar, Dept Comp Sci, Sorocaba, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Fake review; Spam review; Natural language processing; Text categorization; Machine learning; SOCIAL SPAMMERS; MESSAGES;
D O I
10.1016/j.neucom.2018.04.074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online opinions significantly influence consumer purchase decisions. Unfortunately, this has led to a dramatic increase of fake (or spam) reviews that can damage the reputation of brands and artificially manipulate users' perceptions about products and companies. Despite the efforts of several studies on fake review detection, important questions still remain open. For instance, there is no consensus if the performance of the classification methods is affected when they are used in real-world scenarios that require online learning. Moreover, it is also not known if the performance of the methods decreases due to the time-ordered nature of the reviews. To answer these and other important open questions, this work presents a comprehensive analysis of content-based classification methods for fake review detection. The experiments were performed in multiple settings, employing different types of learning and datasets. A careful analysis of the results provided sufficient evidence to respond appropriately to the open questions, which can be used as a baseline for future studies. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:106 / 116
页数:11
相关论文
共 55 条
[1]  
Al Najada H, 2014, 2014 IEEE 15TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), P553, DOI 10.1109/IRI.2014.7051938
[2]   Post or Block? Advances in Automatically Filtering Undesired Comments [J].
Alberto, Tulio C. ;
Lochter, Johannes V. ;
Almeida, Tiago A. .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 80 :S245-S259
[3]  
Algur S. P., 2010, 2010 International Conference on Signal and Image Processing (ICSIP 2010), P416, DOI 10.1109/ICSIP.2010.5697509
[4]   Combating Comment Spam with Machine Learning Approaches [J].
Alsaleh, Mansour ;
Alarifi, Abdulrahman ;
Al-Quayed, Fatima ;
Al-Salman, AbdulMalik .
2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, :295-300
[5]  
[Anonymous], INT J INFORM SECURIT
[6]  
[Anonymous], 1998, P AAAI 98 WORKSH LEA, DOI DOI 10.1109/TSMC.1985.6313426
[7]  
[Anonymous], 2011, 49 ANN M ASS COMP LI, DOI DOI 10.1145/2567948.2577293
[8]  
[Anonymous], 2004, P 21 INT C MACH LEAR, DOI DOI 10.1145/1015330.1015332
[9]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32