Disease outbreak prediction using natural language processing: a review

被引:0
作者
Gautam, Avneet Singh [1 ]
Raza, Zahid [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, JNU Ring Rd, New Delhi 110067, India
关键词
Disease outbreak prediction; Natural language processing; Text analysis; Clustering; Machine learning; News data; Search data; Twitter data; EAST RESPIRATORY SYNDROME; SOCIAL MEDIA; SOUTH-KOREA; SURVEILLANCE; INTELLIGENCE; TWITTER; EBOLA; COVID-19; SYSTEMS;
D O I
10.1007/s10115-024-02192-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on disease outbreak prediction has suddenly received an enormous interest owing to the COVID-19 pandemic. Natural language processing using user-generated text data has proven to be quite effective for the same. Disease outbreaks that occur frequently can be easily predicted, but novel disease outbreaks are difficult to predict. This review work attempts to summarize the research concerning disease outbreaks and the use of datasets such as news headlines, tweets, and search engine queries using natural language processing techniques. Existing state-of-the-art systems have been analytically discussed with their contributions and limitations. This work is an insight into the existing research in the domain of disease outbreak prediction. A total of 146 articles were reviewed in this study, and results show that news and Twitter datasets are being used most to predict disease outbreaks. This research underlines the fact that numerous works are available in the literature based on specific outbreak-related Internet-sourced text data, viz. news, tweets, and search engine queries. However, this becomes a limitation for any disease outbreak prediction system as it can predict only specific disease outbreaks and motivates the development of systems capable of disease outbreak prediction without any bias.
引用
收藏
页码:6561 / 6595
页数:35
相关论文
共 186 条
  • [81] Early warning systems (EWSs) for chikungunya, dengue, malaria, yellow fever, and Zika outbreaks: What is the evidence? A scoping review
    Hussain-Alkhateeb, Laith
    Rivera Ramirez, Tatiana
    Kroeger, Axel
    Gozzer, Ernesto
    Runge-Ranzinger, Silvia
    [J]. PLOS NEGLECTED TROPICAL DISEASES, 2021, 15 (09):
  • [82] International Health Regulations (IHR), About us
  • [83] Using twitter and web news mining to predict COVID-19 outbreak
    Jahanbin, Kia
    Rahmanian, Vahid
    [J]. ASIAN PACIFIC JOURNAL OF TROPICAL MEDICINE, 2020, 13 (08) : 378 - 380
  • [84] EagleEye: A Worldwide Disease-Related Topic Extraction System Using a Deep Learning Based Ranking Algorithm and Internet-Sourced Data
    Jang, Beakcheol
    Kim, Myeonghwi
    Kim, Inhwan
    Kim, Jong Wook
    [J]. SENSORS, 2021, 21 (14)
  • [85] Jang Beakcheol, 2021, JMIR Med Inform, V9, pe23305, DOI 10.2196/23305
  • [86] Word2vec convolutional neural networks for classification of news articles and tweets
    Jang, Beakcheol
    Kim, Inhwan
    Kim, Jong Wook
    [J]. PLOS ONE, 2019, 14 (08):
  • [87] PEACOCK: A Map-Based Multitype Infectious Disease Outbreak Information System
    Jang, Beakcheol
    Lee, Miran
    Kim, Jong Wook
    [J]. IEEE ACCESS, 2019, 7 : 82956 - 82969
  • [88] Scoping future outbreaks: a scoping review on the outbreak prediction of the WHO Blueprint list of priority diseases
    Jonkmans, Nils
    D'Acremont, Valerie
    Flahault, Antoine
    [J]. BMJ GLOBAL HEALTH, 2021, 6 (09):
  • [89] Automated monitoring of tweets for early detection of the 2014 Ebola epidemic
    Joshi, Aditya
    Sparks, Ross
    Karimi, Sarvnaz
    Yan, Sheng-Lun Jason
    Chughtai, Abrar Ahmad
    Paris, Cecile
    MacIntyre, C. Raina
    [J]. PLOS ONE, 2020, 15 (03):
  • [90] Comparison of ARIMA and Random Forest time series models for prediction of avian influenza H5N1 outbreaks
    Kane, Michael J.
    Price, Natalie
    Scotch, Matthew
    Rabinowitz, Peter
    [J]. BMC BIOINFORMATICS, 2014, 15