Sentiment Analysis and Fake Amazon Reviews Classification Using SVM Supervised Machine Learning Model

被引：5

作者：

Tabany, Myasar ^{[1
]}

Gueffal, Meriem ^{[1
]}

机构：

[1] Univ Hertfordshire, Networks Secur & Syst Res Grp, Sch Phys Engn & Comp Sci, Hatfield AL10 9AB, Herts, England

来源：

JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY | 2024年 / 15卷 / 01期

关键词：

fake reviews detection; sentiment analysis; natural language processing; Machine Learning (ML) supervised learning; SPAM;

D O I：

10.12720/jait.15.1.49-58

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This project attempts to conduct sentiment analysis of short and long Amazon reviews and report their effects on the supervised learning Support Vector Machines (SVM) model, to bridge for fake reviews classification. Firstly, the SVM model was evaluated by comparing its performance against Naive Bayes, Logistic Regression, and Random Forest models and proved to be superior (second assumption) based on the accuracy (70%), precision (63%), recall (70%), and F1-score (62%). Hyperparameter tuning improved the SVM model for sentiment analysis (accuracy of 93%), then altering the review length affected the model's performance, which validated that review length affects the classifier (first assumption). Secondly, conducted fake reviews classification on the fake reviews' dataset yielded 88% accuracy, while the merged subsets of the two datasets yielded 84% accuracy.

引用

页码：49 / 58

页数：10

共 42 条

[1]

Amazon Help and Customer Service, Community Guidelines

[2]

Amazon Press center, 2022, Amazon targets fake review fraudsters on social media

[3]

[Anonymous], The FTC Act. the Office of the Law Revision Counsel. 15USCcode45. Unfair methods of competition unlawful

[4]

prevention by Commission

[5]

[Anonymous], 2014, Journal of Science

[6] Support vector machines for histogram-based image classification [J].

Chapelle, O ;

Haffner, P ;

Vapnik, VN .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1055-1064

[7] Experience: Improving Opinion Spam Detection by Cumulative Relative Frequency Distribution [J].

Fazzolari, Michela ;

Buccafurri, Francesco ;

Lax, Gianluca ;

Petrocchi, Marinella .

ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2021, 13 (01)

[8]

Fei G., 2021, arXiv

[9] Detecting and Characterizing Extremist Reviewer Groups in Online Product Reviews [J].

Gupta, Viresh ;

Aggarwal, Aayush ;

Chakraborty, Tanmoy .

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (03) :741-750

[10] Spam Review Detection Using the Linguistic and Spammer Behavioral Methods [J].

Hussain, Naveed ;

Mirza, Hamid Turab ;

Hussain, Ibrar ;

Iqbal, Faiza ;

Memon, Imran .

IEEE ACCESS, 2020, 8 :53801-53816

← 1 2 3 4 5 →