Prediction of new active cases of coronavirus disease (COVID-19) pandemic using multiple linear regression model

被引:110
作者
Rath, Smita [1 ]
Tripathy, Alakananda [1 ]
Tripathy, Alok Ranjan [2 ]
机构
[1] Siksha O Anusandhan Deemed Be Univ, Dept Comp Sci & Engn, Bhubaneswar, Odisha, India
[2] Ravenshaw Univ, Dept Comp Sci, Cuttack, India
关键词
Coronavirus; India; Odisha; Correlation coefficient; Linear regression; Multiple linear regression;
D O I
10.1016/j.dsx.2020.07.045
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Introduction and Aims: The COVID-19 pandemic originated from the city of Wuhan of China has highly affected the health, socio-economic and financial matters of the different countries of the world. India is one of the countries which is affected by the disease and thousands of people on daily basis are getting infected. In this paper, an analysis of daily statistics of people affected by the disease are taken into account to predict the next days trend in the active cases in Odisha as well as India. Material and methods: A valid global data set is collected from the WHO daily statistics and correlation among the total confirmed, active, deceased, positive cases are stated in this paper. Regression model such as Linear and Multiple Linear Regression techniques are applied to the data set to visualize the trend of the affected cases. Results: Here a comparison of Linear Regression and Multiple Linear Regression model is performed where the score of the model R-2 tends to be 0.99 and 1.0 which indicates a strong prediction model to forecast the next coming days active cases. Using the Multiple Linear Regression model as on July month, the forecast value of 52,290 active cases are predicted towards the next month of 15th August in India and 9,358 active cases in Odisha if situation continues like this way. Conclusion: These models acquired remarkable accuracy in COVID-19 recognition. A strong correlation factor determines the relationship among the dependent (active) with the independent variables (positive, deceased, recovered). (c) 2020 Diabetes India. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1467 / 1474
页数:8
相关论文
共 17 条
[1]  
[Anonymous], 2020, YouGov International COVID-19 tracker update
[2]   Sentiment analysis of nationwide lockdown due to COVID 19 outbreak: Evidence from India [J].
Barkur, Gopalkrishna ;
Vibha ;
Kamath, Giridhar B. .
ASIAN JOURNAL OF PSYCHIATRY, 2020, 51
[3]  
Center for Disease Control and Prevention, 2020, SYMPT COR
[4]   Analysis of mechanical properties for tea stem using grey relational analysis coupled with multiple linear regression [J].
Du, Zhe ;
Hu, Yongguang ;
Buttar, Noman Ali .
SCIENTIA HORTICULTURAE, 2020, 260
[5]   Application of artificial neural network and multiple linear regression in modeling nutrient recovery in vermicompost under different conditions [J].
Hosseinzadeh, Ahmad ;
Baziar, Mansour ;
Alidadi, Hossein ;
Zhou, John L. ;
Altaee, Ali ;
Najafpoor, Ali Asghar ;
Jafarpour, Salman .
BIORESOURCE TECHNOLOGY, 2020, 303
[6]   Forecasting Road Traffic Deaths in Thailand: Applications of Time-Series, Curve Estimation, Multiple Linear Regression, and Path Analysis Models [J].
Jomnonkwao, Sajjakaj ;
Uttra, Savalee ;
Ratanavaraha, Vatanavongs .
SUSTAINABILITY, 2020, 12 (01)
[7]   Prediction of water quality index using artificial neural network and multiple linear regression modelling approach in Shivganga River basin, India [J].
Kadam, A. K. ;
Wagh, V. M. ;
Muley, A. A. ;
Umrikar, B. N. ;
Sankhua, R. N. .
MODELING EARTH SYSTEMS AND ENVIRONMENT, 2019, 5 (03) :951-962
[8]  
Khademi Faezehossadat, 2016, International Journal of Sustainable Built Environment, V5, P355, DOI 10.1016/j.ijsbe.2016.09.003
[9]   Analyzing Vietnam's national disaster loss database for flood risk assessment using multiple linear regression-TOPSIS [J].
Luu, Chinh ;
von Meding, Jason ;
Mojtahedi, Mohammad .
INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2019, 40
[10]  
Salleh Faridah Hani Mohamed, 2017, Advances in Bioinformatics, V2017, P4827171, DOI 10.1155/2017/4827171