LISA: Language-Independent Method for Aspect-Based Sentiment Analysis

被引:20
作者
Shams, Mohammadreza [1 ]
Khoshavi, Navid [2 ,3 ]
Baraani-Dastjerdi, Ahmad [4 ]
机构
[1] Univ Shahreza, Dept Comp Engn, Fac Engn, Shahreza 8648141143, Iran
[2] Florida Polytech Univ, Dept Comp Sci, Lakeland, FL 33805 USA
[3] Florida Polytech Univ, Dept Elect & Comp Engn, Lakeland, FL 33805 USA
[4] Univ Isfahan, Fac Comp Engn, Dept Software Engn, Esfahan 8174673441, Iran
关键词
Aspect-based sentiment analysis; aspect extraction; polarity classification; topic modeling; RESOURCES; CLASSIFICATION; MACHINE;
D O I
10.1109/ACCESS.2020.2973587
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding "what others think" is one of the most eminent pieces of knowledge in the decision-making process required in a wide spectrum of applications. The procedure of obtaining knowledge from each aspect (property) of users' opinions is called aspect-based sentiment analysis which consists of three core sub-tasks: aspect extraction, aspect and opinion-words separation, and aspect-level polarity classification. Most successful approaches proposed in this area require a set of primary training or extensive linguistic resources, which makes them relatively costly and time consuming in different languages. To overcome the aforementioned challenges, we propose an unsupervised paradigm for aspect-based sentiment analysis, which is not only simple to use in different languages, but also holistically performs the subtasks for aspect-based sentiment analysis. Our methodology relies on three coarse-grained phases which are partitioned to manifold fine-grained operations. The first phase extracts the prior domain knowledge from dataset through selecting the preliminary polarity lexicon and aspect word sets, as representative of aspects. These two resources, as primitive knowledge, are assigned to an expectation-maximization algorithm to identify the probability of any word based on the aspect and sentiment. To determine the polarity of any aspect in the final phase, the document is firstly broken down to its constituting aspects and the probability of each aspect/polarity based on the document is calculated. To evaluate this method, two datasets in the English and Persian languages are used and the results are compared with various baselines. The experimental results show that the proposed method outperforms the baselines in terms of aspect, opinion-word extraction and aspect-level polarity classification.
引用
收藏
页码:31034 / 31044
页数:11
相关论文
共 34 条
  • [11] Chen ZY, 2014, PR MACH LEARN RES, V32, P703
  • [12] Clematide S., 2010, Proceedings of WASSA, P7
  • [13] W2VLDA: Almost unsupervised system for Aspect Based Sentiment Analysis
    Garcia-Pablos, Aitor
    Cuadros, Montse
    Rigau, German
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 : 127 - 137
  • [14] Big Data Software Engineering: Analysis of Knowledge Domains and Skill Sets Using LDA-Based Topic Modeling
    Gurcan, Fatih
    Cagiltay, Nergiz Ercil
    [J]. IEEE ACCESS, 2019, 7 : 82541 - 82552
  • [15] Probabilistic latent semantic indexing
    Hofmann, T
    [J]. SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 50 - 57
  • [16] Hu MQ, 2004, PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, P755
  • [17] Automatic construction of domain-specific sentiment lexicon based on constrained label propagation
    Huang, Sheng
    Niu, Zhendong
    Shi, Chongyang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 191 - 200
  • [18] Combining resources to improve unsupervised sentiment analysis at aspect-level
    Jimenez-Zafra, Salud M.
    Teresa Martin-Valdivia, M.
    Martinez-Camara, Eugenio
    Alfonso Urena-Lopez, L.
    [J]. JOURNAL OF INFORMATION SCIENCE, 2016, 42 (02) : 213 - 229
  • [19] Jo Y., 2011, P 4 ACM INT C WEB SE, P815, DOI DOI 10.1145/1935826.1935932
  • [20] Kamps J., 2004, In Lrec, V4, P1115