Comprehending E-commerce product reviews: a sentiment analysis approach

被引：0

作者：

Chugh, Mitali ^{[1
]}

Vishwakarma, Anushtha ^{[1
]}

Gupta, Pranjali ^{[1
]}

Garg, Anmol ^{[1
]}

机构：

[1] UPES, Dehra Dun, Uttarakhand, India

来源：

PROGRESS IN ARTIFICIAL INTELLIGENCE | 2025年

关键词：

E-commerce reviews; Web scraping; Random forest classifier; Machine learning classification; Polarity classification; Natural language processing (NLP);

D O I：

10.1007/s13748-025-00382-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, with the advancement of Internet technology, users have increasingly opted for online shopping as a convenient and preferred mode of buying and using products. To enhance user satisfaction, Sentiment Analysis (SA) is performed on many user reviews on e-commerce platforms. However, accurately predicting the sentiment polarities of user reviews is still challenging due to variations in sequence length, textual order, and complex logic. In this paper, by employing sentiment analysis through TextBlob, the study categorized analyzed tweets and Amazon reviews positive and negative sentiments. The SA process has four main steps: (i) Gathering data (DC), (ii) cleaning and preparing the data, (iii) extracting features (FE) or assigning weights to terms (TW), selecting relevant features (FS), and iv) classifying the polarity or sentiments (SC) of the data. Initially, the Web Scrapping Tool (WST) was used to extract customer reviews from E-commerce websites. The data taken amounted to 9872 tweets, then pre-processed and went through the TF-IDF process. 80% of the data are used for training, and 20% are used for testing, which is then classified using Random Forest, which classifies customer reviews' sentiment as positive and negative. Based on the test results, the values were calculated and obtained, and they were as follows: 89.93% accuracy, 10.06% error, 92.05% precision, 89.18% recall, and 90.59% F1-score. Finally, the data results, with 68.7% positive reviews, suggest that Amazon's dataset benefits from prebuilt security against abusive language.

引用

页数：13

共 51 条

[1] Sentiment analysis of Arabic social media texts: A machine learning approach to deciphering customer perceptions [J].

Alsemaree, Ohud ;

Alam, Atm S. ;

Gill, Sukhpal Singh ;

Uhlig, Steve .

HELIYON, 2024, 10 (09)

[2]

Amin F, 2022, J. Eng. Res.

[3]

An J, 2020, Understanding Calculation of TF-IDF by Example

[4]

[Anonymous], 2022, Creating a TF-IDF Model from Scratch in Python

[5]

Bharadwaj L., 2023, Int. J. Multidiscip. Res, V5, P1, DOI [10.36948/ijfmr.2023.v05i05.6090, DOI 10.36948/IJFMR.2023.V05I05.6090]

[6]

Chauhan C, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), P26, DOI 10.1109/CCAA.2017.8229825

[7] A Review of the F-Measure: Its History, Properties, Criticism, and Alternatives [J].

Christen, Peter ;

Hand, David J. ;

Kirielle, Nishadi .

ACM COMPUTING SURVEYS, 2024, 56 (03)

[8]

Dashtipour K, 2016, COGN COMPUT, V8, P757, DOI 10.1007/s12559-016-9415-7

[9]

Fang X., 2015, Journal of Big Data, V2, P1, DOI DOI 10.1186/S40537-015-0015-2

[10] SEDIS-A Rumor Propagation Model for Social Networks by Incorporating the Human Nature of Selection [J].

Govindankutty, Sreeraag ;

Gopalan, Shynu Padinjappurathu .

SYSTEMS, 2023, 11 (01)

← 1 2 3 4 5 6 →