Evaluating the plausible application of advanced machine learnings in exploring determinant factors of present pandemic: A case for continent specific COVID-19 analysis

被引:22
作者
Chakraborti, Suman [1 ]
Maiti, Arabinda [2 ]
Pramanik, Suvamoy [1 ]
Sannigrahi, Srikanta [3 ]
Pilla, Francesco [3 ]
Banerjee, Anushna [1 ]
Das, Dipendra Nath [1 ]
机构
[1] Jawaharlal Nehru Univ, Ctr Study Reg Dev, Delhi 110067, India
[2] Vidyasagar Univ, Geog & Environm Management, Midnapore, W Bengal, India
[3] Univ Coll Dublin Richview, Sch Architecture Planning & Environm Policy, Dublin D14 E099, Ireland
关键词
Air pollution; Machine learning; Pandemic; Socioeconomic; COVID-19;
D O I
10.1016/j.scitotenv.2020.142723
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Coronavirus disease, a novel severe acute respiratory syndrome (SARS COVID-19), has become a global health concern due to its unpredictable nature and lack of adequate medicines. Machine Learning (ML) models could be effective in identifying the most critical factors which are responsible for the overall fatalities caused by COVID-19. The functional capabilities of ML models in epidemiological research, especially for COVID-19, are not substantially explored. To bridge this gap, this study has adopted two advanced ML models, viz. Random Forest (RF) and Gradient Boosted Machine (GBM), to perform the regression modelling and provide subsequent interpretation. Five successive steps were followed to carry out the analysis: (1) identification of relevant key explanatory variables; (2) application of data dimensionality reduction for eliminating redundant information; (3) utilizing ML models for measuring relative influence (RI) of the explanatory variables; (4) evaluating interconnections between and among the key explanatory variables and COVID-19 case and death counts; (5) time series analysis for examining the rate of incidences of COVID-19 cases and deaths. Among the explanatory variables considered in this study, air pollution, migration, economy, and demographic factor were found to be the most significant controlling factors. Since a very limited research is available to discuss the superiority of ML models for identifying the key determinants of COVID-19, this study could be a reference for future public health research. Additionally, all the models and data used in this study are open source and freely available, thereby, reproducibility and scientific replication will be achievable easily. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 71 条
[1]   Obese communities among the best predictors of COVID-19-related deaths [J].
AbdelMassih, Antoine Fakhry ;
Ghaly, Ramy ;
Amin, Abeer ;
Gaballah, Amr ;
Kamel, Aya ;
Heikal, Bassant ;
Menshawey, Esraa ;
Ismail, Habiba-Allah ;
Hesham, Hend ;
Attallah, Josephine ;
Eshak, Kirollos ;
Moursi, Mai ;
Khaled-Ibn-ElWalid, Mariam ;
Tawfik, Marwa ;
Tarek, Mario ;
Mohy-El-Din, Mayan ;
Habib, Menna ;
Hafez, Nada ;
Bahnan, Odette ;
Ismail, Passant ;
Senoussy, Sara ;
Ghaly, Sherry ;
Farah, Sousanna ;
Hozaien, Rafeef ;
Adel, Veronia ;
Khaled, Mariam .
CARDIOVASCULAR ENDOCRINOLOGY & METABOLISM, 2020, 9 (03) :102-107
[2]   Modelling spatial variations of coronavirus disease (COVID-19) in Africa [J].
Adekunle, Ibrahim Ayoade ;
Onanuga, Abayomi Toyin ;
Akinola, Olanrewaju Olugbenga ;
Ogunbanjo, Olakitan Wahab .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 729
[3]   Investigation of effective climatology parameters on COVID-19 outbreak in Iran [J].
Ahmadi, Mohsen ;
Sharifi, Abbas ;
Dorosti, Shadi ;
Ghoushchi, Saeid Jafarzadeh ;
Ghanbari, Negar .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 729
[4]   Prediction of Epidemic Peak and Infected Cases for COVID-19 Disease in Malaysia, 2020 [J].
Alsayed, Abdallah ;
Sadir, Hayder ;
Kamil, Raja ;
Sari, Hasan .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (11) :1-15
[5]  
[Anonymous], 2020, GOOGLE COVID 19 COMM
[6]   Geostatistical COVID-19 infection risk maps for Portugal [J].
Azevedo, Leonardo ;
Pereira, Maria Joao ;
Ribeiro, Manuel C. ;
Soares, Amilcar .
INTERNATIONAL JOURNAL OF HEALTH GEOGRAPHICS, 2020, 19 (01)
[7]   Does lockdown reduce air pollution? Evidence from 44 cities in northern China [J].
Bao, Rui ;
Zhang, Acheng .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 731
[8]   Correlation between climate indicators and COVID-19 pandemic in New York, USA [J].
Bashir, Muhammad Farhan ;
Ma, Benjiang ;
Bilal ;
Komal, Bushra ;
Bashir, Muhammad Adnan ;
Tan, Duojiao ;
Bashir, Madiha .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 728
[9]   Spread of SARS-CoV-2 through Latin America and the Caribbean region: A look from its economic conditions, climate and air pollution indicators [J].
Bolano-Ortiz, Tomas R. ;
Camargo-Caicedo, Yiniva ;
Enrique Puliafito, Salvador ;
Florencia Ruggeri, Maria ;
Bolano-Diaz, Sindy ;
Pascual-Flores, Romina ;
Saturno, Jorge ;
Ibarra-Espinosa, Sergio ;
Mayol-Bracero, Olga L. ;
Torres-Delgado, Elvis ;
Cereceda-Balic, Francisco .
ENVIRONMENTAL RESEARCH, 2020, 191
[10]  
Borghi J., 2020, WE NEED GLOBAL HLTH