A classified feature representation three-way decision model for sentiment analysis

被引:0
作者
Jie Chen
Yue Chen
Yechen He
Yang Xu
Shu Zhao
Yanping Zhang
机构
[1] Ministry of Education,Key Laboratory of Intelligent Computing and Signal Processing
[2] Anhui University,School of Computer Science and Technology
[3] Beijing University of Posts and Telecommunications,undefined
来源
Applied Intelligence | 2022年 / 52卷
关键词
Sentiment analysis; Feature selection; A classified feature representation; Three-way decision;
D O I
暂无
中图分类号
学科分类号
摘要
Binary sentiment analysis uses sentiment dictionaries, TF-IDF, word2vec, and BERT to convert text documents such as product and movie reviews into vectors. Dimensionality reduction by feature selection can effectively reduce the complexity of sentiment analysis. Existing feature selection methods put all samples together and ignore the difference in the feature representation between different categories. For binary sentiment analysis, there are some reviews with uncertain sentiment polarity, three-way decision divides samples into positive (POS) region, negative (NEG) region, and uncertain region (UNC). The model based on the three-way decision is beneficial to process the UNC and improve the effect of binary sentiment analysis. However, how to obtain the optimal feature representation in certain regions respectively to process the uncertain samples is a challenge. In this paper, a classified feature representation three-way decision model is proposed to obtain the optimal feature representation of the positive and negative domains for sentiment analysis. In the positive domain and the negative domain, m- and n-layer feature representations are obtained. The optimal layer with the best performance is selected as the optimal feature representation. The POS region and the NEG region in the testing set are processed by the optimal feature representation, the UNC region is processed by the original feature representation. Experiments on IMDB and Amazon show that the performance of our proposed method in terms of classification accuracy in sentiment analysis is significantly higher than that of the chi-square, principal component analysis, and mutual information methods.
引用
收藏
页码:7995 / 8007
页数:12
相关论文
共 80 条
  • [11] Rath SK(2020)Variable-precision three-way concepts in L-contexts Int J Approx Reason 130 107-125
  • [12] Barkha B(2020)Local temporal-spatial multi-granularity learning for sequential three-way granular computing Inf Sci 541 75-97
  • [13] Sangeet S(2020)On modeling similarity and three-way decision under incomplete information in rough set theory Knowl Based Syst 191 105251-325
  • [14] Hochreiter S(2020)Partial-overall dominance three-way decision models in interval-valued decision systems Int J Approx Reason 126 308-808
  • [15] Schmidhuber J(2020)Fuzzy neighborhood covering for three-way classification Inf Sci 507 795-78
  • [16] Chen L-C(2017)Cost-sensitive sequential three-way decision modeling using a deep neural network Int J Approx Reason 85 68-30
  • [17] Lee C-M(2018)Three-way decisions based on neutrosophic sets and AHP-QFD framework for supplier selection problem Futur Gener Comput Syst 89 19-144
  • [18] Chen M-Y(2018)An oversampling method for imbalance data based on Three-Way decision model Acta Electron Sin 46 135-1848
  • [19] Fujita H(2019)Resilience analysis of critical infrastructures: a cognitive approach based on granular computing IEEE Trans Cybern 49 1835-188
  • [20] Gaeta A(2017)A unified model of sequential three-way decisions and multilevel incremental processing Knowl-Based Syst 134 172-92