A classified feature representation three-way decision model for sentiment analysis

被引:0
作者
Jie Chen
Yue Chen
Yechen He
Yang Xu
Shu Zhao
Yanping Zhang
机构
[1] Ministry of Education,Key Laboratory of Intelligent Computing and Signal Processing
[2] Anhui University,School of Computer Science and Technology
[3] Beijing University of Posts and Telecommunications,undefined
来源
Applied Intelligence | 2022年 / 52卷
关键词
Sentiment analysis; Feature selection; A classified feature representation; Three-way decision;
D O I
暂无
中图分类号
学科分类号
摘要
Binary sentiment analysis uses sentiment dictionaries, TF-IDF, word2vec, and BERT to convert text documents such as product and movie reviews into vectors. Dimensionality reduction by feature selection can effectively reduce the complexity of sentiment analysis. Existing feature selection methods put all samples together and ignore the difference in the feature representation between different categories. For binary sentiment analysis, there are some reviews with uncertain sentiment polarity, three-way decision divides samples into positive (POS) region, negative (NEG) region, and uncertain region (UNC). The model based on the three-way decision is beneficial to process the UNC and improve the effect of binary sentiment analysis. However, how to obtain the optimal feature representation in certain regions respectively to process the uncertain samples is a challenge. In this paper, a classified feature representation three-way decision model is proposed to obtain the optimal feature representation of the positive and negative domains for sentiment analysis. In the positive domain and the negative domain, m- and n-layer feature representations are obtained. The optimal layer with the best performance is selected as the optimal feature representation. The POS region and the NEG region in the testing set are processed by the optimal feature representation, the UNC region is processed by the original feature representation. Experiments on IMDB and Amazon show that the performance of our proposed method in terms of classification accuracy in sentiment analysis is significantly higher than that of the chi-square, principal component analysis, and mutual information methods.
引用
收藏
页码:7995 / 8007
页数:12
相关论文
共 80 条
  • [1] Mehta P(2019)NICFS: A novel feature selection method applied to lexicon based sentiment analysis Intell Decis Technol 13 41-48
  • [2] Chandra S(2020)Optimal feature selection for Learning-Based algorithms for sentiment classification Cogn Comput 12 238-248
  • [3] Wang Z(2020)Efficient Feature Selection techniques for Sentiment Analysis Multimed Tools Appl 79 6313-6335
  • [4] Lin Z(2019)A new feature selection method for sentiment analysis in short text J Intell Syst 29 1122-1134
  • [5] Madasu A(2016)Classification of sentiment reviews using n-gram machine learning approach Expert Syst Appl 57 117-126
  • [6] Sivasankar E(2019)Hybrid attribute based sentiment classification of online reviews for consumer intelligence Appl Intell 49 137-149
  • [7] Kumar HMK(1997)Long Short-Term memory Neural Comput 9 1735-1780
  • [8] Harish BS(2020)Exploration of social media for sentiment analysis using deep learning Soft Comput 24 8187-8197
  • [9] Tripathy A(2020)Hypotheses analysis and assessment in counterterrorism activities: a method based on OWA and fuzzy probabilistic rough sets IEEE Trans Fuzzy Syst 28 831-845
  • [10] Agrawal A(2018)Three-way decision and granular computing Int J Approx Reason 103 107-123