LISA: Language-Independent Method for Aspect-Based Sentiment Analysis

被引:20
作者
Shams, Mohammadreza [1 ]
Khoshavi, Navid [2 ,3 ]
Baraani-Dastjerdi, Ahmad [4 ]
机构
[1] Univ Shahreza, Dept Comp Engn, Fac Engn, Shahreza 8648141143, Iran
[2] Florida Polytech Univ, Dept Comp Sci, Lakeland, FL 33805 USA
[3] Florida Polytech Univ, Dept Elect & Comp Engn, Lakeland, FL 33805 USA
[4] Univ Isfahan, Fac Comp Engn, Dept Software Engn, Esfahan 8174673441, Iran
关键词
Aspect-based sentiment analysis; aspect extraction; polarity classification; topic modeling; RESOURCES; CLASSIFICATION; MACHINE;
D O I
10.1109/ACCESS.2020.2973587
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding "what others think" is one of the most eminent pieces of knowledge in the decision-making process required in a wide spectrum of applications. The procedure of obtaining knowledge from each aspect (property) of users' opinions is called aspect-based sentiment analysis which consists of three core sub-tasks: aspect extraction, aspect and opinion-words separation, and aspect-level polarity classification. Most successful approaches proposed in this area require a set of primary training or extensive linguistic resources, which makes them relatively costly and time consuming in different languages. To overcome the aforementioned challenges, we propose an unsupervised paradigm for aspect-based sentiment analysis, which is not only simple to use in different languages, but also holistically performs the subtasks for aspect-based sentiment analysis. Our methodology relies on three coarse-grained phases which are partitioned to manifold fine-grained operations. The first phase extracts the prior domain knowledge from dataset through selecting the preliminary polarity lexicon and aspect word sets, as representative of aspects. These two resources, as primitive knowledge, are assigned to an expectation-maximization algorithm to identify the probability of any word based on the aspect and sentiment. To determine the polarity of any aspect in the final phase, the document is firstly broken down to its constituting aspects and the probability of each aspect/polarity based on the document is calculated. To evaluate this method, two datasets in the English and Persian languages are used and the results are compared with various baselines. The experimental results show that the proposed method outperforms the baselines in terms of aspect, opinion-word extraction and aspect-level polarity classification.
引用
收藏
页码:31034 / 31044
页数:11
相关论文
共 34 条
  • [1] Arabic senti-lexicon: Constructing publicly available language resources for Arabic sentiment analysis
    Al-Moslmi, Tareq
    Albared, Mohammed
    Al-Shabi, Adel
    Omar, Nazlia
    Abdullah, Salwani
    [J]. JOURNAL OF INFORMATION SCIENCE, 2018, 44 (03) : 345 - 362
  • [2] Deep Recurrent neural network vs. support vector machine for aspect-based sentiment analysis of Arabic hotels' reviews
    Al-Smadi, Mohammad
    Qawasmeh, Omar
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Gupta, Brij
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 27 : 386 - 393
  • [3] [Anonymous], 2011, Comprehensive Review Of Opinion Summarization (Survey)
  • [4] [Anonymous], 2006, P 5 INT C LANG RES E
  • [5] [Anonymous], IEEE T KNOWL DATA EN
  • [6] Baccianella S, 2010, LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
  • [7] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [8] Bollacker Kurt, 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
  • [9] A sentiment classification model based on multiple classifiers
    Catal, Cagatary
    Nangir, Mehmet
    [J]. APPLIED SOFT COMPUTING, 2017, 50 : 135 - 141
  • [10] Experimental explorations on short text topic mining between LDA and NMF based Schemes
    Chen, Yong
    Zhang, Hui
    Liu, Rui
    Ye, Zhiwen
    Lin, Jianying
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 1 - 13