An ensemble approach for spam detection in Arabic opinion texts

被引:31
|
作者
Saeed, Radwa M. K. [1 ]
Rady, Sherine [1 ]
Gharib, Tarek F. [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Informat Syst Dept, Cairo 11566, Egypt
关键词
Arabic spam reviews detection; N-gram features; Content-based features; Negation handling; Machine learning; Ensemble approach;
D O I
10.1016/j.jksuci.2019.10.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, individuals express experiences and opinions through online reviews. These has an influence on online marketing and obtaining real knowledge about products and services. However, some of the online reviews can be unreal. They may have been written to promote low-quality products/services or sabotage a product/service reputation to mislead potential customers. Such misleading reviews are known as spam reviews and require crucial attention. Prior spam detection research focused on English reviews, with less attention to other languages. The detection of spam reviews in Arabic online sources is a relatively new topic despite the relatively huge amount of data generated. Therefore, this paper contributes to such topic by presenting four different Arabic spam reviews detection methods, while putting more focus towards the construction and evaluation of an ensemble approach. The proposed ensemble method is based on integrating a rule-based classifier with machine learning techniques, while utilizing content-based features that depend on N-gram features and Negation handling. The four proposed methods are evaluated on two datasets of different sizes. The results indicate the efficiency of the ensemble approach where it achieves a classification accuracy of 95.25% and 99.98% for the two experimented datasets and outperforming existing related work by far of 25%. (c) 2019 The Authors. Production and hosting by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:1407 / 1416
页数:10
相关论文
共 50 条
  • [1] A real-time framework for opinion spam detection in Arabic social networks
    Ezzat, Cherry A.
    Alkadri, Abdullah M.
    Elkorany, Abeer
    EGYPTIAN INFORMATICS JOURNAL, 2025, 29
  • [2] A Proposed Spam Detection Approach for Arabic Social Networks Content
    Mataoui, M'hamed
    Zelmati, Omar
    Boughaci, Dalila
    Chaouche, Moncef
    Lagoug, Fatima
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MATHEMATICS AND INFORMATION TECHNOLOGY (ICMIT), 2017, : 222 - 226
  • [3] Optimizing Sentiment Classification for Arabic Opinion Texts
    Radwa M. K. Saeed
    Sherine Rady
    Tarek F. Gharib
    Cognitive Computation, 2021, 13 : 164 - 178
  • [4] Optimizing Sentiment Classification for Arabic Opinion Texts
    Saeed, Radwa M. K.
    Rady, Sherine
    Gharib, Tarek F.
    COGNITIVE COMPUTATION, 2021, 13 (01) : 164 - 178
  • [5] Opinion Spam Detection through User Behavioral Graph Partitioning Approach
    Manaskasemsak, Bundit
    Chanmakho, Chayada
    Klainongsuang, Jakapong
    Rungsawang, Arnon
    2019 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE (ISMSI 2019), 2019, : 73 - 77
  • [6] An approach to spam detection by Naive Bayes ensemble based on decision induction
    Yang, Zhen
    Nie, Xiangfei
    Xu, Weiran
    Guo, Jun
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 861 - +
  • [7] Ensemble Decision for Spam Detection Using Term Space Partition Approach
    Tan, Ying
    Wang, Quanbin
    Mi, Guyue
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (01) : 297 - 309
  • [8] A Neural Network-Based Ensemble Approach for Spam Detection in Twitter
    Madisetty, Sreekanth
    Desarkar, Maunendra Sankar
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2018, 5 (04): : 973 - 984
  • [9] Ensemble Classifiers for Spam Review Detection
    Ibrahim, Alhassan J.
    Siraj, Maheyzah Md
    Din, Mazura Mat
    2017 IEEE CONFERENCE ON APPLICATION, INFORMATION AND NETWORK SECURITY (AINS), 2017, : 130 - 134
  • [10] Spam Detection Using Ensemble Learning
    Gupta, Vashu
    Mehta, Aman
    Goel, Akshay
    Dixit, Utkarsh
    Pandey, Avinash Chandra
    HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 661 - 668