Multi-Level Sentiment Analysis of Product Reviews Based on Grammar Rules

被引:2
作者
Hien D Nguyen [1 ,2 ]
Thanh Le [1 ,2 ]
Khiem Tran [1 ,2 ]
Son T Luu [1 ,2 ]
Suong N Hoang [3 ]
Hieu T Phan [1 ,2 ]
机构
[1] Univ Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Olli Technol, Ho Chi Minh City, Vietnam
来源
NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES | 2021年 / 337卷
关键词
Sentiment analysis; Sentiment Classification; Vietnamese corpus; dataset; Product Review; Grammar Rules; Multitask learning;
D O I
10.3233/FAIA210043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vietnamese is a tonal and isolated language. Its highly ambiguity makes the designing of methods for sentiment analysis being difficult. For getting the most effectiveness, the designed method has to analyze sentiment of sentences based on combining the grammar and syllable structures of Vietnamese. In this paper, a method to build a Vietnamese dataset of product reviews with many sentiment levels, including very negative, negative, neutral, positive and very positive, is proposed. This method can be scaled to a large dataset using for analyzing sentiment of product reviews. Moreover, a solution to add more grammar rules of Vietnamese into the pre-processing of sentiment analysis is also constructed. Those rules simulate the sentiment recognition of humans and help to increase the accuracy of sentiment determination. The combination of grammar rules and some methods for sentiment analysis are experimented on the Vietnamese dataset of product reviews to classify them into sentiment-levels. The testing results show that their accuracy and F-measure are improved and suitable to apply in the practical business analyzing of customer behaviors.
引用
收藏
页码:444 / 456
页数:13
相关论文
共 42 条
[41]  
Pham XT, 2020, INT CONF KNOWL SYS, P207, DOI [10.1109/kse50997.2020.9287775, 10.1109/KSE50997.2020.9287775]
[42]  
Zainuddin Nurulhuda, 2019, Advancing Technology Industrialization Through Intelligent Software Methodologies, Tools and Techniques, P284