Sentiment Difficulty in Aspect-Based Sentiment Analysis

被引:6
作者
Chifu, Adrian-Gabriel [1 ]
Fournier, Sebastien [1 ]
机构
[1] Univ Toulon & Var, Aix Marseille Univ, CNRS, LIS, F-13007 Marseille, France
基金
英国科研创新办公室;
关键词
sentiment analysis; aspect-based sentiment analysis; difficulty; sentiment polarity; text representation; MODEL; LSTM;
D O I
10.3390/math11224647
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Subjectivity is a key aspect of natural language understanding, especially in the context of user-generated text and conversational systems based on large language models. Natural language sentences often contain subjective elements, such as opinions and emotions, that make them more nuanced and complex. The level of detail at which the study of the text is performed determines the possible applications of sentiment analysis. The analysis can be done at the document or paragraph level, or, even more granularly, at the aspect level. Many researchers have studied this topic extensively. The field of aspect-based sentiment analysis has numerous data sets and models. In this work, we initiate the discussion around the definition of sentence difficulty in this context of aspect-based sentiment analysis. To assess and quantify the difficulty of the aspect-based sentiment analysis, we conduct an experiment using three data sets: "Laptops", "Restaurants", and "MTSC" (Multi-Target-dependent Sentiment Classification), along with 21 learning models from scikit-learn. We also use two textual representations, TF-IDF (Terms frequency-inverse document frequency) and BERT (Bidirectional Encoder Representations from Transformers), to analyze the difficulty faced by these models in performing aspect-based sentiment analysis. Additionally, we compare the models with a fine-tuned version of BERT on the three data sets. We identify the most challenging sentences using a combination of classifiers in order to better understand them. We propose two strategies for defining sentence difficulty. The first strategy is binary and considers sentences as difficult when the classifiers are unable to correctly assign the sentiment polarity. The second strategy uses a six-level difficulty scale based on how many of the top five best-performing classifiers can correctly identify sentiment polarity. These sentences with assigned difficulty classes are then used to create predictive models for early difficulty detection. The purpose of estimating the difficulty of aspect-based sentiment analysis is to enhance performance while minimizing resource usage.
引用
收藏
页数:33
相关论文
共 71 条
[51]  
Mutlu MM, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, P467
[52]  
Pennington Jeffrey, 2014, P 2014 EMNLP ACL, P1532, DOI DOI 10.3115/V1/D14-1162
[53]  
Pontiki M, 2014, P INT WORKSHOP SEMAN, DOI [10.3115/v1/s14-2004, 10.3115/v1/S14-2004, DOI 10.3115/V1/S14-2004]
[54]   A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis [J].
Rehman, Anwar Ur ;
Malik, Ahmad Kamran ;
Raza, Basit ;
Ali, Waqar .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) :26597-26613
[55]   Ensemble learning: A survey [J].
Sagi, Omer ;
Rokach, Lior .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (04)
[56]   VECTOR-SPACE MODEL FOR AUTOMATIC INDEXING [J].
SALTON, G ;
WONG, A ;
YANG, CS .
COMMUNICATIONS OF THE ACM, 1975, 18 (11) :613-620
[57]   Query association surrogates for Web search [J].
Scholer, F ;
Williams, HE ;
Turpin, A .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (07) :637-650
[58]   Predicting Query Performance by Query-Drift Estimation [J].
Shtok, Anna ;
Kurland, Oren ;
Carmel, David ;
Raiber, Fiana ;
Markovits, Gad .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (02)
[59]  
Tao Yongquan, 2014, CIKM, P1891
[60]   A systematic review of social network sentiment analysis with comparative study of ensemble-based techniques [J].
Tiwari, Dimple ;
Nagpal, Bharti ;
Bhati, Bhoopesh Singh ;
Mishra, Ashutosh ;
Kumar, Manoj .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (11) :13407-13461