Comprehending E-commerce product reviews: a sentiment analysis approach

被引:0
作者
Chugh, Mitali [1 ]
Vishwakarma, Anushtha [1 ]
Gupta, Pranjali [1 ]
Garg, Anmol [1 ]
机构
[1] UPES, Dehra Dun, Uttarakhand, India
关键词
E-commerce reviews; Web scraping; Random forest classifier; Machine learning classification; Polarity classification; Natural language processing (NLP);
D O I
10.1007/s13748-025-00382-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, with the advancement of Internet technology, users have increasingly opted for online shopping as a convenient and preferred mode of buying and using products. To enhance user satisfaction, Sentiment Analysis (SA) is performed on many user reviews on e-commerce platforms. However, accurately predicting the sentiment polarities of user reviews is still challenging due to variations in sequence length, textual order, and complex logic. In this paper, by employing sentiment analysis through TextBlob, the study categorized analyzed tweets and Amazon reviews positive and negative sentiments. The SA process has four main steps: (i) Gathering data (DC), (ii) cleaning and preparing the data, (iii) extracting features (FE) or assigning weights to terms (TW), selecting relevant features (FS), and iv) classifying the polarity or sentiments (SC) of the data. Initially, the Web Scrapping Tool (WST) was used to extract customer reviews from E-commerce websites. The data taken amounted to 9872 tweets, then pre-processed and went through the TF-IDF process. 80% of the data are used for training, and 20% are used for testing, which is then classified using Random Forest, which classifies customer reviews' sentiment as positive and negative. Based on the test results, the values were calculated and obtained, and they were as follows: 89.93% accuracy, 10.06% error, 92.05% precision, 89.18% recall, and 90.59% F1-score. Finally, the data results, with 68.7% positive reviews, suggest that Amazon's dataset benefits from prebuilt security against abusive language.
引用
收藏
页数:13
相关论文
共 51 条
[11]  
Hidayah I, 2019, P 2019 5 INT C SCI T, P41
[12]  
Ismet HT., 2022, Sci. J. Inform, V9, P73
[13]  
Jadhav H.B., INT C COMP NETW BIG
[14]   Interaction between extrinsic and intrinsic online review cues: perspectives from cue utilization theory [J].
Kakaria, Shobhit ;
Simonetti, Aline ;
Bigne, Enrique .
ELECTRONIC COMMERCE RESEARCH, 2024, 24 (04) :2469-2497
[15]  
Karani KP., 2020, Int. J. Appl. Eng. Manag. Lett, V4, P2581
[16]  
Karim Mirsa, 2018, IOP Conference Series: Materials Science and Engineering, V396, DOI 10.1088/1757-899X/396/1/012020
[17]  
Kawale R., 2024, J. Oper Syst. Dev. Trends, V11, P25
[18]   Sentiment analysis and opinion mining applied to scientific paper reviews [J].
Keith Norambuena, Brian ;
Fuentes Lettura, Exequiel ;
Meneses Villegas, Claudio .
INTELLIGENT DATA ANALYSIS, 2019, 23 (01) :191-214
[19]   Sentiment and Context-Aware Hybrid DNN With Attention for Text Sentiment Classification [J].
Khan, Jawad ;
Ahmad, Niaz ;
Khalid, Shah ;
Ali, Farman ;
Lee, Youngmoon .
IEEE ACCESS, 2023, 11 :28162-28179
[20]   DNNRec: A novel deep learning based hybrid recommender system [J].
Kiran, R. ;
Kumar, Pradeep ;
Bhasker, Bharat .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 144