Sentiment Difficulty in Aspect-Based Sentiment Analysis

被引:6
作者
Chifu, Adrian-Gabriel [1 ]
Fournier, Sebastien [1 ]
机构
[1] Univ Toulon & Var, Aix Marseille Univ, CNRS, LIS, F-13007 Marseille, France
基金
英国科研创新办公室;
关键词
sentiment analysis; aspect-based sentiment analysis; difficulty; sentiment polarity; text representation; MODEL; LSTM;
D O I
10.3390/math11224647
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Subjectivity is a key aspect of natural language understanding, especially in the context of user-generated text and conversational systems based on large language models. Natural language sentences often contain subjective elements, such as opinions and emotions, that make them more nuanced and complex. The level of detail at which the study of the text is performed determines the possible applications of sentiment analysis. The analysis can be done at the document or paragraph level, or, even more granularly, at the aspect level. Many researchers have studied this topic extensively. The field of aspect-based sentiment analysis has numerous data sets and models. In this work, we initiate the discussion around the definition of sentence difficulty in this context of aspect-based sentiment analysis. To assess and quantify the difficulty of the aspect-based sentiment analysis, we conduct an experiment using three data sets: "Laptops", "Restaurants", and "MTSC" (Multi-Target-dependent Sentiment Classification), along with 21 learning models from scikit-learn. We also use two textual representations, TF-IDF (Terms frequency-inverse document frequency) and BERT (Bidirectional Encoder Representations from Transformers), to analyze the difficulty faced by these models in performing aspect-based sentiment analysis. Additionally, we compare the models with a fine-tuned version of BERT on the three data sets. We identify the most challenging sentences using a combination of classifiers in order to better understand them. We propose two strategies for defining sentence difficulty. The first strategy is binary and considers sentences as difficult when the classifiers are unable to correctly assign the sentiment polarity. The second strategy uses a six-level difficulty scale based on how many of the top five best-performing classifiers can correctly identify sentiment polarity. These sentences with assigned difficulty classes are then used to create predictive models for early difficulty detection. The purpose of estimating the difficulty of aspect-based sentiment analysis is to enhance performance while minimizing resource usage.
引用
收藏
页数:33
相关论文
共 71 条
[1]  
Ahmad M, 2018, INT J ADV COMPUT SC, V9, P182
[2]  
Ahmed Afrin, 2021, Proceedings of International Conference on Trends in Computational and Cognitive Engineering. Proceedings of TCCE 2020. Advances in Intelligent Systems and Computing (AISC 1309), P181, DOI 10.1007/978-981-33-4673-4_16
[3]  
[Anonymous], 2009, WebDB
[4]   BiasFinder: Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems [J].
Asyrofi, Muhammad Hilmi ;
Yang, Zhou ;
Yusuf, Imam Nur Bani ;
Kang, Hong Jin ;
Thung, Ferdian ;
Lo, David .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (12) :5087-5101
[5]   ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis [J].
Basiri, Mohammad Ehsan ;
Nemati, Shahla ;
Abdar, Moloud ;
Cambria, Erik ;
Acharya, U. Rajendra .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 :279-294
[6]   Sentiment analysis: A survey on design framework, applications and future scopes [J].
Bordoloi, Monali ;
Biswas, Saroj Kumar .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (11) :12505-12560
[7]   A Survey on Aspect-Based Sentiment Classification [J].
Brauwers, Gianni ;
Frasincar, Flavius .
ACM COMPUTING SURVEYS, 2023, 55 (04)
[8]  
Brown TB, 2020, ADV NEUR IN, V33
[9]   Knowledge-Based Approaches to Concept-Level Sentiment Analysis INTRODUCTION [J].
Cambria, Erik ;
Schuller, Bjoern ;
Liu, Bing ;
Wang, Haixun ;
Havasi, Catherine .
IEEE INTELLIGENT SYSTEMS, 2013, 28 (02) :12-14
[10]  
Carmel D, 2010, SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, P911