PAN-LDA: A latent Dirichlet allocation based novel feature extraction model for COVID-19 data using machine learning

被引:21
|
作者
Gupta, Aakansha [1 ]
Katarya, Rahul [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Big Data Analyt & Web Intelligence Lab, New Delhi, India
关键词
COVID-19; Latent dirichlet allocation; Collapsed gibbs sampling; Data mining; Feature extraction; Backpropagation;
D O I
10.1016/j.compbiomed.2021.104920
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent outbreak of novel Coronavirus disease or COVID-19 is declared a pandemic by the World Health Organization (WHO). The availability of social media platforms has played a vital role in providing and obtaining information about any ongoing event. However, consuming a vast amount of online textual data to predict an event's trends can be troublesome. To our knowledge, no study analyzes the online news articles and the disease data about coronavirus disease. Therefore, we propose an LDA-based topic model, called PAN-LDA (Pandemic Latent Dirichlet allocation), that incorporates the COVID-19 cases data and news articles into common LDA to obtain a new set of features. The generated features are introduced as additional features to Machine learning (ML) algorithms to improve the forecasting of time series data. Furthermore, we are employing collapsed Gibbs sampling (CGS) as the underlying technique for parameter inference. The results from experiments suggest that the obtained features from PAN-LDA generate more identifiable topics and empirically add value to the outcome.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Efficient analysis of COVID-19 clinical data using machine learning models
    Sarwan Ali
    Yijing Zhou
    Murray Patterson
    Medical & Biological Engineering & Computing, 2022, 60 : 1881 - 1896
  • [22] Efficient analysis of COVID-19 clinical data using machine learning models
    Ali, Sarwan
    Zhou, Yijing
    Patterson, Murray
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (07) : 1881 - 1896
  • [23] Predictive analysis and survey of COVID-19 using machine learning and big data
    Sharma, Shruti
    Gupta, Yogesh Kumar
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2021, 24 (01) : 175 - 195
  • [24] COVID-19 Data Analysis and Appropriate Vaccine Prediction using Machine Learning
    Ullah, Md. Oli
    Nobel, S. M. Nuruzzaman
    2022 IEEE 13TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2022, : 496 - 504
  • [25] Automated COVID-19 diagnosis and classification using convolutional neural network with fusion based feature extraction model
    K. Shankar
    Sachi Nandan Mohanty
    Kusum Yadav
    T. Gopalakrishnan
    Ahmed M. Elmisery
    Cognitive Neurodynamics, 2023, 17 : 1 - 14
  • [26] An automated COVID-19 detection based on fused dynamic exemplar pyramid feature extraction and hybrid feature selection using deep learning
    Ozyurt, Fatih
    Tuncer, Turker
    Subasi, Abdulhamit
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 132
  • [27] An automated COVID-19 detection based on fused dynamic exemplar pyramid feature extraction and hybrid feature selection using deep learning
    Ozyurt, Fatih
    Tuncer, Turker
    Subasi, Abdulhamit
    Computers in Biology and Medicine, 2021, 132
  • [28] Automated COVID-19 diagnosis and classification using convolutional neural network with fusion based feature extraction model
    Shankar, K.
    Mohanty, Sachi Nandan
    Yadav, Kusum
    Gopalakrishnan, T.
    Elmisery, Ahmed M.
    COGNITIVE NEURODYNAMICS, 2023, 17 (03) : 1 - 14
  • [29] An approach to forecast impact of Covid-19 using supervised machine learning model
    Mohan, Senthilkumar
    John, A.
    Abugabah, Ahed
    Adimoolam, M.
    Kumar Singh, Shubham
    Kashif Bashir, Ali
    Sanzogni, Louis
    SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (04) : 824 - 840
  • [30] SENTIMENTAL ANALYSIS OF COVID-19 TWITTER DATA USING DEEP LEARNING AND MACHINE LEARNING MODELS
    Darad, Simran
    Krishnan, Sridhar
    INGENIUS-REVISTA DE CIENCIA Y TECNOLOGIA, 2023, (29): : 108 - 116