PAN-LDA: A latent Dirichlet allocation based novel feature extraction model for COVID-19 data using machine learning

被引:21
|
作者
Gupta, Aakansha [1 ]
Katarya, Rahul [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Big Data Analyt & Web Intelligence Lab, New Delhi, India
关键词
COVID-19; Latent dirichlet allocation; Collapsed gibbs sampling; Data mining; Feature extraction; Backpropagation;
D O I
10.1016/j.compbiomed.2021.104920
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent outbreak of novel Coronavirus disease or COVID-19 is declared a pandemic by the World Health Organization (WHO). The availability of social media platforms has played a vital role in providing and obtaining information about any ongoing event. However, consuming a vast amount of online textual data to predict an event's trends can be troublesome. To our knowledge, no study analyzes the online news articles and the disease data about coronavirus disease. Therefore, we propose an LDA-based topic model, called PAN-LDA (Pandemic Latent Dirichlet allocation), that incorporates the COVID-19 cases data and news articles into common LDA to obtain a new set of features. The generated features are introduced as additional features to Machine learning (ML) algorithms to improve the forecasting of time series data. Furthermore, we are employing collapsed Gibbs sampling (CGS) as the underlying technique for parameter inference. The results from experiments suggest that the obtained features from PAN-LDA generate more identifiable topics and empirically add value to the outcome.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Machine learning based model for detecting depression during Covid-19 crisis
    Sofia, Arun
    Malik, Arun
    Shabaz, Mohammad
    Asenso, Evans
    SCIENTIFIC AFRICAN, 2023, 20
  • [32] Deep learning-based feature extraction coupled with multi class SVM for COVID-19 detection in the IoT era
    Saleh, Mubarak Auwalu
    Serte, Sertan
    Al-Turjman, Fadi
    Abdulkadir, R. A.
    Ameen, Zubaida Sa'id
    Ozsoz, Mehmet
    INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (1-4) : 7 - 24
  • [33] Identification of cuproptosis-related molecular subtypes and a novel predictive model of COVID-19 based on machine learning
    Luo, Hong
    Yan, Jisong
    Zhang, Dingyu
    Zhou, Xia
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [34] A Comprehensive Investigation of Machine Learning Feature Extraction and Classification Methods for Automated Diagnosis of COVID-19 Based on X-ray Images
    Mohammed, Mazin Abed
    Abdulkareem, Karrar Hameed
    Garcia-Zapirain, Begonya
    Mostafa, Salama A.
    Maashi, Mashael S.
    Al-Waisy, Alaa S.
    Subhi, Mohammed Ahmed
    Mutlag, Ammar Awad
    Dac-Nhuong Le
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (03): : 3289 - 3310
  • [35] A Novel Frequency Based Feature Extraction Technique for Classification of Corona Virus Genome and Discovery of COVID-19 Repeat Pattern
    Murugaiah, Muthulakshmi
    Ganesan, Murugeswari
    BRAZILIAN ARCHIVES OF BIOLOGY AND TECHNOLOGY, 2021, 64
  • [36] Recognition Model based Feature Extraction and Kernel Extreme Learning Machine for High Dimensional data
    Shi, Yan
    Zhao, Lijie
    Tang, Jian
    MATERIALS RESEARCH AND APPLICATIONS, PTS 1-3, 2014, 875-877 : 2020 - +
  • [37] Machine learning-based automatic detection of novel coronavirus (COVID-19) disease
    Bhargava, Anuja
    Bansal, Atul
    Goyal, Vishal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 13731 - 13750
  • [38] HJ-Biplot as a Tool to Give an Extra Analytical Boost for the Latent Dirichlet Assignment (LDA) Model: With an Application to Digital News Analysis about COVID-19
    Pilacuan-Bonete, Luis
    Galindo-Villardon, Purificacion
    Delgado-Alvarez, Francisco
    MATHEMATICS, 2022, 10 (14)
  • [39] A Novel Weighted Consensus Machine Learning Model for COVID-19 Infection Classification Using CT Scan Images
    Bondugula, Rohit Kumar
    Udgata, Siba K.
    Bommi, Nitin Sai
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 11039 - 11050
  • [40] A Novel Weighted Consensus Machine Learning Model for COVID-19 Infection Classification Using CT Scan Images
    Rohit Kumar Bondugula
    Siba K. Udgata
    Nitin Sai Bommi
    Arabian Journal for Science and Engineering, 2023, 48 : 11039 - 11050