Sentiment Analysis of User Reviews Integrating Margin Sampling and Tri-training

被引:0
作者
Jiang Y. [1 ]
Zhang T. [1 ]
Xia Z. [1 ]
Li Y. [2 ]
Zhang Z. [1 ]
机构
[1] College of Information Management, Nanjing Agricultural University, Nanjing
[2] College of Artificial Intelligence, Nanjing Agricultural University, Nanjing
关键词
Margin Sampling; Sentiment Analysis; Tri-Training; User Reviews;
D O I
10.11925/infotech.2096-3467.2023.0519
中图分类号
学科分类号
摘要
[Objective] This paper proposes a sentiment analysis method for user reviews integrating margin sampling and tri-training. It addresses the issues of the large volume of user reviews, ambiguous sentiment tendencies, and short content. [Methods] First, we constructed a multi-class support vector machine based on a one-vs-all decomposition strategy. Then, we integrated a margin sampling strategy considering cosine similarity to create an initial set. Finally, we proposed a Tri-training algorithm combining a soft voting mechanism. [Results] The proposed algorithm improved the voting mechanism in the Tri-training algorithm, which further reduced the probability of misjudgment in sample classification by multiple classifiers. All categories achieved precision rates above 79%. [Limitations] The proposed method does not consider extracting information from multimedia data. [Conclusions] Compared with traditional and recently improved semi-supervised learning algorithms, the proposed algorithm demonstrates classification accuracy and efficiency superiority. © 2024 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:102 / 112
页数:10
相关论文
共 38 条
  • [1] The“14th Five-Year”National Informatization Planning
  • [2] Zhou Jian, Liu Yanbao, Liu Jiajia, Exploration of Intellectual Structure and Hot Issues in Sentiment Analysis Research, Journal of the China Society for Scientific and Technical Information, 39, 1, pp. 111-124, (2020)
  • [3] Liu Tong, Liu Chen, Ni Weijian, A Semi-Supervised Sentiment Analysis Method for Chinese Based on Multi-Level Data Augmentation, Data Analysis and Knowledge Discovery, 5, 5, pp. 51-58, (2021)
  • [4] Li Lei, Song Jianwei, Liu Ji, Analyzing the Effect of Reputation Based on Sentiment Analysis of Online Comment Texts, Chinese Journal of Management, 17, 4, pp. 583-591, (2020)
  • [5] Ma Fengcai, Li Chunyue, Research on E-Commerce Consumer Satisfaction Measurement of Fresh Products—Analysis Based on Online Reviews of JD Fresh, Price: Theory & Practice, 5, pp. 117-120, (2020)
  • [6] Liu Yulin, Jian Lirong, Data Mining of E-Commerce Online Reviews Based on Sentiment Analysis, Statistics & Information Forum, 33, 12, pp. 119-124, (2018)
  • [7] Lu Weicong, Xu Jian, Sentiment Analysis of Network Users’ Reviews Based on Bipartite Network, Information Studies: Theory & Application, 41, 2, pp. 121-126, (2018)
  • [8] Chang C H, Hwang S Y, Wu M L., Learning Bilingual Sentiment Lexicon for Online Reviews, Electronic Commerce Research and Applications, 47, (2021)
  • [9] Zhang J, Lu X C, Liu D., Deriving Customer Preferences for Hotels Based on Aspect-Level Sentiment Analysis of Online Reviews, Electronic Commerce Research and Applications, 49, (2021)
  • [10] Li H, Chen Q, Zhong Z, Et al., E-Word of Mouth Sentiment Analysis for User Behavior Studies, Information Processing and Management, 59, 1, (2022)