Explainable Sentiment Analysis: A Hierarchical Transformer-Based Extractive Summarization Approach

被引:18
作者
Bacco, Luca [1 ,2 ]
Cimino, Andrea [2 ]
Dell'Orletta, Felice [2 ]
Merone, Mario [1 ]
机构
[1] Univ Campus Biomed Roma, Unit Comp Syst & Bioinformat, Dept Engn, I-00128 Rome, Italy
[2] Ist Linguist Computaz Antonio Zampolli ILC CNR, ItaliaNLP Lab, I-56124 Pisa, Italy
关键词
sentiment analysis; explainability; hierarchical transformers; extractive summarization;
D O I
10.3390/electronics10182195
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the explainable artificial intelligence (XAI) paradigm is gaining wide research interest. The natural language processing (NLP) community is also approaching the shift of paradigm: building a suite of models that provide an explanation of the decision on some main task, without affecting the performances. It is not an easy job for sure, especially when very poorly interpretable models are involved, like the almost ubiquitous (at least in the NLP literature of the last years) transformers. Here, we propose two different transformer-based methodologies exploiting the inner hierarchy of the documents to perform a sentiment analysis task while extracting the most important (with regards to the model decision) sentences to build a summary as the explanation of the output. For the first architecture, we placed two transformers in cascade and leveraged the attention weights of the second one to build the summary. For the other architecture, we employed a single transformer to classify the single sentences in the document and then combine the probability scores of each to perform the classification and then build the summary. We compared the two methodologies by using the IMDB dataset, both in terms of classification and explainability performances. To assess the explainability part, we propose two kinds of metrics, based on benchmarking the models' summaries with human annotations. We recruited four independent operators to annotate few documents retrieved from the original dataset. Furthermore, we conducted an ablation study to highlight how implementing some strategies leads to important improvements on the explainability performance of the cascade transformers model.
引用
收藏
页数:19
相关论文
共 63 条
[31]  
Letarte G., 2018, P 2018 EMNLP WORKSHO, P267, DOI DOI 10.18653/V1/W18-5429
[32]  
Liu B, 2011, DATA CENTRIC SYST AP, P459, DOI 10.1007/978-3-642-19460-3_11
[33]  
Liu Yinhan, 2019, CoRR, DOI [DOI 10.48550/ARXIV.1907.11692, 10.48550/arXiv.1907.11692]
[34]   Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View [J].
Loyola-Gonzalez, Octavio .
IEEE ACCESS, 2019, 7 :154096-154113
[35]  
Maas Andrew, 2011, P 49 ANN M ASS COMP, P142
[36]  
McGlohon M., 2010, P 4 INT C WEBL SOC M
[37]   A Survey on Transfer Learning [J].
Pan, Sinno Jialin ;
Yang, Qiang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) :1345-1359
[38]  
Pappagari R, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P838, DOI [10.1109/ASRU46091.2019.9003958, 10.1109/asru46091.2019.9003958]
[39]   Zero-Shot Learning for Cross-Lingual News Sentiment Classification [J].
Pelicon, Andraz ;
Pranjic, Marko ;
Miljkovic, Dragana ;
Skrlj, Blaz ;
Pollak, Senja .
APPLIED SCIENCES-BASEL, 2020, 10 (17)
[40]  
Radford A., 2019, Language Models are Few-Shot Learners