TA-WHI : Text Analysis of Web-Based Health Information

被引:0
作者
Bagla, Piyush [1 ]
Kumar, Kuldeep [2 ]
机构
[1] Dr B R Ambedkar Natl Inst Technol, Jalandhar, India
[2] Natl Inst Technol Kurukshetra, Kurukshetra, India
来源
INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI | 2023年 / 15卷 / 01期
关键词
Credibility; Data Mining; Health information; Machine Learning; NLP; Online Health; Text Mining; WHI; MEDIA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The healthcare data available on social media has exploded in recent years. The cures and treatments suggested by non-medical experts can lead to more damage than expected. Assuring the credibility of the information conveyed is an enormous challenge. This study aims to categorize the credibility of online health information into multiple classes. This paper proposes a model named Text Analysis of Web-based Health Information (TA-WHI), based on an algorithm designed for this. It categorizes health-related social media feeds into five categories: sufficient, fabricated, meaningful, advertisement, and misleading. The authors have created their own labeled dataset for this model. For data cleaning, they have designed a dictionary having nouns, adverbs, adjectives, negative words, positive words, and medical terms named MeDF. Using polarity and conditional procedure, the data is ranked and classified into multiple classes. The authors evaluate the performance of the model using deep-learning classifiers such as CNN, LSTM, and CatBoost. The suggested model has attained an accuracy of 98% with CatBoost.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Misinformation about COVID-19: evidence for differential latent profiles and a strong association with trust in science
    Agley, Jon
    Xiao, Yunyu
    [J]. BMC PUBLIC HEALTH, 2021, 21 (01)
  • [2] Ahmed AA, 2021, Arxiv, DOI [arXiv:2102.04458, DOI 10.48550/ARXIV.2102.04458, 10.48550/arXiv.2102.04458]
  • [3] Hybrid Approach for Sentiment Analysis of Twitter Posts Using a Dictionary-based Approach and Fuzzy Logic Methods: Study Case on Cloud Service Providers
    Alharbi, Jamilah Rabeh
    Alhalabi, Wadee S.
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2020, 16 (01) : 116 - 145
  • [4] Credibility in Online Social Networks: A Survey
    Alrubaian, Majed
    Al-Qurishi, Muhammad
    Alamri, Atif
    Al-Rakhami, Mabrook
    Hassan, Mohammad Mehedi
    Fortino, Giancarlo
    [J]. IEEE ACCESS, 2019, 7 : 2828 - 2855
  • [5] Andrea K., 2022, TEXT ANAL NLP HEALTH
  • [6] [Anonymous], 2022, The Washington Post
  • [7] Automatic Fact-Checking Using Context and Discourse Information
    Atanasova, Pepa
    Nakov, Preslav
    Marquez, Lluis
    Barron-Cedeno, Alberto
    Karadzhov, Georgi
    Mihaylova, Tsvetomila
    Mohtarami, Mitra
    Glass, James
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2019, 11 (03):
  • [8] Baresi L., 2018, Encyclopedia of Database Systems, VSecond, DOI DOI 10.1007/978-1-4614-8265-9
  • [9] Bhattarai B, 2021, Arxiv, DOI arXiv:2105.09114
  • [10] Demographics and topics impact on the co-spread of COVID-19 misinformation and fact-checks on Twitter
    Burel, Gregoire
    Farrell, Tracie
    Alani, Harith
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)