Machine Learning-Based Opinion Spam Detection: A Systematic Literature Review

被引:0
|
作者
Qazi, Atika [1 ]
Hasan, Najmul [2 ]
Mao, Rui [3 ]
Elhag Mohamed Abo, Mohamed [4 ]
Kumar Dey, Samrat [5 ]
Hardaker, Glenn [6 ]
机构
[1] Univ Brunei Darussalam, Ctr Lifelong Learning, Bandar Seri Begawan BE1410, Brunei
[2] BRAC Univ, BRAC Business Sch, Dhaka 1212, Bangladesh
[3] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 6397984, Singapore
[4] Future Univ, Fac Comp Sci, Khartoum 10553, Sudan
[5] Bangladesh Open Univ BOU, Sch Sci & Technol SST, Gazipur 1705, Bangladesh
[6] King Abdullah Univ Sci & Technol, Off Provost, Thuwal 23955, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Reviews; Business; Sentiment analysis; Decision making; Unsolicited e-mail; Systematics; Object recognition; Detection algorithms; Group spam; machine learning; opinion mining; opinion spam detection; spammer detection; sentiment analysis; INFORMATION; NETWORK; MODEL;
D O I
10.1109/ACCESS.2024.3399264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The substantial upsurge in Web 2.0 deployment allows a large number of consumers to share their opinions with potential consumers and producers through product and service review platforms. Customer decision-making relies on an important aspect of review: spam-free opinions. Unfortunately, fraudulent activities produced spam reviews that misled potential buyers and traders. Subsequently, it does stop opinion-mining techniques from reaching accurate conclusions. This study aims to identify and review existing state-of-the-art methodologies for three groups: 1) spam reviews; 2) individual spammers; and 3) group spam. The machine learning (ML) and deep learning (DL) techniques for spam detection are categorized, and we establish an assessment that may be deemed appropriate in the field. The findings reveal a total of 10 metrics, with accuracy being the most often used (25%) concerning ML-based techniques in spam detection. Followed by recall in 24% of studies, and precision in 22% of studies. In addition, the F-measure, the area under the curve (AUC), and F1-score evaluation metrics reveal that the use of the Amazon dataset as a whole increased by 7%. This study concludes that the majority of SMS spam filtering strategies are leading solutions. In addition, the taxonomy of existing state-of-the-art methodologies is developed, and it is concluded that a substantial number of studies utilize these existing SMS anti-spam applications. This research uncovered previously unexplored areas of ML and DL's application to spam review and provided a new paradigm for applying these technologies to the issue. The findings provide both academics and practitioners with a deeper understanding of the barriers to spam review identification, as well as potential improvement opportunities using ML techniques. These individuals will now have the chance to consider a benchmark for future research.
引用
收藏
页码:143485 / 143499
页数:15
相关论文
共 50 条
  • [1] Machine Learning-Based Detection of Spam Emails
    Bin Siddique, Zeeshan
    Khan, Mudassar Ali
    Din, Ikram Ud
    Almogren, Ahmad
    Mohiuddin, Irfan
    Nazir, Shah
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [2] Machine and Deep Learning-based XSS Detection Approaches: A Systematic Literature Review
    Thajeel, Isam Kareem
    Samsudin, Khairulmizam
    Hashim, Shaiful Jahari
    Hashim, Fazirulhisyam
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
  • [3] Machine learning-based heart disease diagnosis: A systematic literature review
    Ahsan, Md Manjurul
    Siddique, Zahed
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 128
  • [4] A Performance Evaluation of Machine Learning-Based Streaming Spam Tweets Detection
    Chen, Chao
    Zhang, Jun
    Xie, Yi
    Xiang, Yang
    Zhou, Wanlei
    Hassan, Mohammad Mehedi
    AlElaiwi, Abdulhameed
    Alrubaian, Majed
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2015, 2 (03) : 65 - 76
  • [5] Vulnerability detection through machine learning-based fuzzing: A systematic review
    Chafjiri, Sadegh Bamohabbat
    Legg, Phil
    Hong, Jun
    Tsompanas, Michail-Antisthenis
    COMPUTERS & SECURITY, 2024, 143
  • [6] Learning textual features for Twitter spam detection: A systematic literature review
    Abkenar, Sepideh Bazzaz
    Kashani, Mostafa Haghi
    Akbari, Mohammad
    Mahdipour, Ebrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [7] A systematic literature review of machine learning-based disease profiling and personalized treatment
    Buettner, Ricardo
    Klenk, Florian
    Ebert, Marc
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1673 - 1678
  • [8] Machine Learning-Based Resource Management in Fog Computing: A Systematic Literature Review
    Khan, Fahim Ullah
    Shah, Ibrar Ali
    Jan, Sadaqat
    Ahmad, Shabir
    Whangbo, Taegkeun
    SENSORS, 2025, 25 (03)
  • [9] MACHINE LEARNING BASED TWITTER SPAM ACCOUNT DETECTION: A REVIEW
    Gheewala, Shivangi
    Patel, Rakesh
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 79 - 84
  • [10] Spam Review Detection Techniques: A Systematic Literature Review
    Hussain, Naveed
    Mirza, Hamid Turab
    Rasool, Ghulam
    Hussain, Ibrar
    Kaleem, Mohammad
    APPLIED SCIENCES-BASEL, 2019, 9 (05):