How do we talk about doctors and drugs? Sentiment analysis in forums expressing opinions for medical domain

被引:37
作者
Maria Jimenez-Zafra, Salud [1 ]
Teresa Martin-Valdivia, M. [1 ]
Dolores Molina-Gonzalez, M. [1 ]
Alfonso Urena-Lopez, L. [1 ]
机构
[1] Univ Jaen, Dept Comp Sci, Adv Studies Ctr Informat & Commun Technol CEATIC, Campus Las Lagunillas, E-23071 Jaen, Spain
关键词
Spanish corpus; Patient opinions; Medical domain; Sentiment analysis; SEMANTIC ORIENTATION; SOCIAL MEDIA; CLASSIFICATION;
D O I
10.1016/j.artmed.2018.03.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The main goal of this study is to examine how people express their opinion in medical forums. We analyze the language used in order to determine the best way to tackle sentiment analysis in this domain. Methods: We have applied supervised learning and lexicon-based sentiment analysis approaches over two different corpora extracted from social web. Specifically, we have focused on two aspects: drugs and doctors. We have selected two forums and we have collected corpora for each one: (i) DOS, a Spanish corpus of drug reviews and (ii) COPOS, a Spanish corpus of patients' opinions about physicians. Results: The classification results show that drug reviews are more difficult to classify than those about physicians. In order to understand the difference in the results, we have studied the linguistic features of both corpora. Conclusions: Although opinions about physicians and drugs are written in most cases by non-professional users, reviews about physicians are characterized by the use of an informal language while reviews about drugs are characterized by a combination of informal language with specific terminology (e.g. adverse effects, drug names) with greater lexical diversity, making the task of sentiment analysis difficult. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 4 条
  • [1] How do tweeters feel about scientific misinformation: an infoveillance sentiment analysis of tweets on retraction notices and retracted papers
    Amiri, Mahsa
    Yaghtin, Maryam
    Sotudeh, Hajar
    SCIENTOMETRICS, 2024, 129 (01) : 261 - 287
  • [2] How People With a Bipolar Disorder Diagnosis Talk About Personal Recovery in Peer Online Support Forums: Corpus Framework Analysis Using the POETIC Framework
    Jagfeld, Glorianna
    Lobban, Fiona
    Humphreys, Chloe
    Rayson, Paul
    Jones, Steven Huntley
    JMIR MEDICAL INFORMATICS, 2023, 11
  • [3] How do tweeters feel about scientific misinformation: an infoveillance sentiment analysis of tweets on retraction notices and retracted papers
    Mahsa Amiri
    Maryam Yaghtin
    Hajar Sotudeh
    Scientometrics, 2024, 129 : 261 - 287
  • [4] How Do People View COVID-19 Vaccines: Analyses on Tweets About COVID-19 Vaccines Using Natural Language Processing and Sentiment Analysis
    Chang, Victor
    Ng, Chun Yu
    Xu, Qianwen Ariel
    Guizani, Mohsen
    Hossain, M. A.
    JOURNAL OF GLOBAL INFORMATION MANAGEMENT, 2022, 30 (10)