An Italian Lexicon-based Sentiment Analysis approach for medical applications

被引:1
作者
Martinis, Maria Chiara [1 ]
Zucco, Chiara [1 ]
Cannataro, Mario [1 ]
机构
[1] Magna Graecia Univ Catanzaro, Data Analyt Res Ctr, Dept Med & Surg Sci, Catanzaro, Italy
来源
13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022 | 2022年
关键词
Sentiment Analysis; Lexicon-based approaches; Healthcare applications; VADER; VADER-IT;
D O I
10.1145/3535508.3545594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis aims at extracting opinions and or emotions mainly from written text. The most popular problem in sentiment analysis certainly is polarity detection, which falls into the broader class of Natural Language Processing (NLP) problems of text classification. To date, state-of-the-art approaches to text classification use neural language models built on popular architectures such as Transformers. However, these approaches are difficult to apply in low-resource languages and domains, as for instance the Italian language or small clinical trials. Motivated by this, this paper presents VADER-IT, a lexicon-based algorithm for polarity prediction in written text, that is an adaptation to the Italian language of the popular VADER. Unlike VADER, our system also predicts a polarity class (i.e. positive, negative or neutral). The system was tested on a dataset of 5495 healthcare related reviews from QSalute https://www.qsalute.it/, reaching a micro averaged F1-score = 81% and a micro averaged Jaccard - score = 73%.
引用
收藏
页数:4
相关论文
共 21 条
  • [1] Bacco L., 2020, Computational Linguistics CLiCit 2020, V630, P16
  • [2] Basile V., 2013, Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, P100
  • [3] Bradley M. M., 1999, AFFECTIVE NORMS ENGL
  • [4] Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian
    Catelli, Rosario
    Pelosi, Serena
    Esposito, Massimo
    [J]. ELECTRONICS, 2022, 11 (03)
  • [5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [6] Sentiment Analysis of Twitter Data
    El Rahman, Sahar A.
    AlOtaibi, Feddah Alhumaidi
    AlShehri, Wejdan Abdullah
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 336 - 339
  • [7] Elbagir Shihab, 2019, P INT MULTICONFERENC, V122
  • [8] Huang KX, 2020, Arxiv, DOI arXiv:1904.05342
  • [9] Hutto C., 2015, P 8 INT AAAI C WEBL, P216
  • [10] AMMU: A survey of transformer-based biomedical pretrained language models
    Kalyan, Katikapalli Subramanyam
    Rajasekharan, Ajit
    Sangeetha, Sivanesan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 126