Machine Learning-Based Opinion Spam Detection: A Systematic Literature Review

被引：0

作者：

Qazi, Atika ^{[1
]}

Hasan, Najmul ^{[2
]}

Mao, Rui ^{[3
]}

Elhag Mohamed Abo, Mohamed ^{[4
]}

Kumar Dey, Samrat ^{[5
]}

Hardaker, Glenn ^{[6
]}

机构：

[1] Univ Brunei Darussalam, Ctr Lifelong Learning, Bandar Seri Begawan BE1410, Brunei

[2] BRAC Univ, BRAC Business Sch, Dhaka 1212, Bangladesh

[3] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 6397984, Singapore

[4] Future Univ, Fac Comp Sci, Khartoum 10553, Sudan

[5] Bangladesh Open Univ BOU, Sch Sci & Technol SST, Gazipur 1705, Bangladesh

[6] King Abdullah Univ Sci & Technol, Off Provost, Thuwal 23955, Saudi Arabia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Reviews; Business; Sentiment analysis; Decision making; Unsolicited e-mail; Systematics; Object recognition; Detection algorithms; Group spam; machine learning; opinion mining; opinion spam detection; spammer detection; sentiment analysis; INFORMATION; NETWORK; MODEL;

D O I：

10.1109/ACCESS.2024.3399264

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The substantial upsurge in Web 2.0 deployment allows a large number of consumers to share their opinions with potential consumers and producers through product and service review platforms. Customer decision-making relies on an important aspect of review: spam-free opinions. Unfortunately, fraudulent activities produced spam reviews that misled potential buyers and traders. Subsequently, it does stop opinion-mining techniques from reaching accurate conclusions. This study aims to identify and review existing state-of-the-art methodologies for three groups: 1) spam reviews; 2) individual spammers; and 3) group spam. The machine learning (ML) and deep learning (DL) techniques for spam detection are categorized, and we establish an assessment that may be deemed appropriate in the field. The findings reveal a total of 10 metrics, with accuracy being the most often used (25%) concerning ML-based techniques in spam detection. Followed by recall in 24% of studies, and precision in 22% of studies. In addition, the F-measure, the area under the curve (AUC), and F1-score evaluation metrics reveal that the use of the Amazon dataset as a whole increased by 7%. This study concludes that the majority of SMS spam filtering strategies are leading solutions. In addition, the taxonomy of existing state-of-the-art methodologies is developed, and it is concluded that a substantial number of studies utilize these existing SMS anti-spam applications. This research uncovered previously unexplored areas of ML and DL's application to spam review and provided a new paradigm for applying these technologies to the issue. The findings provide both academics and practitioners with a deeper understanding of the barriers to spam review identification, as well as potential improvement opportunities using ML techniques. These individuals will now have the chance to consider a benchmark for future research.

引用

页码：143485 / 143499

页数：15

共 50 条

[1] Machine Learning-Based Detection of Spam Emails
Bin Siddique, Zeeshan
Khan, Mudassar Ali
Din, Ikram Ud
Almogren, Ahmad
Mohiuddin, Irfan
Nazir, Shah
SCIENTIFIC PROGRAMMING, 2021, 2021
[2] Machine and Deep Learning-based XSS Detection Approaches: A Systematic Literature Review
Thajeel, Isam Kareem
Samsudin, Khairulmizam
Hashim, Shaiful Jahari
Hashim, Fazirulhisyam
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
[3] Machine learning-based heart disease diagnosis: A systematic literature review
Ahsan, Md Manjurul
Siddique, Zahed
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 128
[4] A Performance Evaluation of Machine Learning-Based Streaming Spam Tweets Detection
Chen, Chao
Zhang, Jun
Xie, Yi
Xiang, Yang
Zhou, Wanlei
Hassan, Mohammad Mehedi
AlElaiwi, Abdulhameed
Alrubaian, Majed
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2015, 2 (03) : 65 - 76
[5] Vulnerability detection through machine learning-based fuzzing: A systematic review
Chafjiri, Sadegh Bamohabbat
Legg, Phil
Hong, Jun
Tsompanas, Michail-Antisthenis
COMPUTERS & SECURITY, 2024, 143
[6] Learning textual features for Twitter spam detection: A systematic literature review
Abkenar, Sepideh Bazzaz
Kashani, Mostafa Haghi
Akbari, Mohammad
Mahdipour, Ebrahim
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
[7] A systematic literature review of machine learning-based disease profiling and personalized treatment
Buettner, Ricardo
Klenk, Florian
Ebert, Marc
2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1673 - 1678
[8] Machine Learning-Based Resource Management in Fog Computing: A Systematic Literature Review
Khan, Fahim Ullah
Shah, Ibrar Ali
Jan, Sadaqat
Ahmad, Shabir
Whangbo, Taegkeun
SENSORS, 2025, 25 (03)
[9] MACHINE LEARNING BASED TWITTER SPAM ACCOUNT DETECTION: A REVIEW
Gheewala, Shivangi
Patel, Rakesh
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 79 - 84
[10] Spam Review Detection Techniques: A Systematic Literature Review
Hussain, Naveed
Mirza, Hamid Turab
Rasool, Ghulam
Hussain, Ibrar
Kaleem, Mohammad
APPLIED SCIENCES-BASEL, 2019, 9 (05):

← 1 2 3 4 5 →