Aspect Based Sentiment Analysis in E-Commerce User Reviews Using Latent Dirichlet Allocation (LDA) and Sentiment Lexicon

被引:2
作者
Wahyudi, Eko [1 ]
Kusumaningrum, Retno [1 ]
机构
[1] Diponegoro Univ, Dept Informat, Semarang, Indonesia
来源
2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019) | 2019年
关键词
latent dirichlet allocation; sentiment analysis; e-commerce; user reviews; product quality;
D O I
10.1109/icicos48119.2019.8982522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
User ratings on products sold by e-commerce greatly influence the number of purchases. Positive ratings will encourage other buyers to participate in buying the product. While negative ratings given by users will reduce the interest in purchasing products. Nonconformities between rating and user reviews sometimes provide a wrong assessment of a product. This happens because buyers also provide reviews on the quality of delivery services from e-commerce. Based on that issue, the utilization of the Latent Dirichlet Allocation (LDA) could be used on sentiment analysis of the user reviews. Sentiment analysis of the user reviews aims to facilitate e-commerce in informing the product quality as rating supporters that have been given by users. This research aims to determine the classification performance of sentiment analysis on e-commerce user reviews using the LDA algorithm with input data in the form of e-commerce user reviews. Then, compare the application of sentiment analysis of the user reviews with the use of general training data and per category training data. The result of this research showed that in the first iteration the best architecture was produced by the application of LDA with a combination of parameters of alpha 0.001, beta 0.001, and number of topics 15. The architecture had 67,5% accuracy level. From the best architecture then training data input is given based on each product review category. The result showed that the combination of the usage of general data and per category data indicate an increase in the average accuracy of 0,82% from the three-test data. Therefore, in order to produce the best performance of building a classification model of sentiment analysis of the user reviews, it should be performed by applying LDA with a combination of general data and per category data usage.
引用
收藏
页数:6
相关论文
共 17 条
[1]  
APJII, 2014, PROF PENGG INT IND 2
[2]   Learning Topic Models - Going beyond SVD [J].
Arora, Sanjeev ;
Ge, Rong ;
Moitra, Ankur .
2012 IEEE 53RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2012, :1-10
[3]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[4]  
Brody S., 2010, Computational Linguistics, P804
[5]  
Darling W. M., 2011, P 49 ANN M ASS COMP, P642
[6]  
Dave K., 2003, In proceedings of the International World Wide Web Conference (WWW), P519, DOI DOI 10.1145/775152.775226
[7]   Computerized retrieval and classification: An application to reasons for late filings with the securities and exchange commission [J].
Feldman, Ronen ;
Rosenfeld, Benjamin ;
Lazar, Ron ;
Livnat, Joshua ;
Segal, Benjamin .
INTELLIGENT DATA ANALYSIS, 2006, 10 (02) :183-195
[8]  
Girolami Mark, 2003, SIGIR, P433, DOI DOI 10.1145/860435.860537
[9]   Finding scientific topics [J].
Griffiths, TL ;
Steyvers, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 :5228-5235
[10]  
Han J, 2012, MOR KAUF D, P1