Design of novel etl model to analyse corona virus data

被引:0
|
作者
Dewangan A.K. [1 ]
Ghosh S.M. [2 ]
Shrivas A.K. [3 ]
机构
[1] Department of Information Technology, Guru Ghasidas Vishwavidyalaya, Bilaspur
[2] Department of Computer Science and Engineering, Dr. C.V. Raman University, Kota, Bilaspur
[3] Department of Computer Science and Information Technology, Guru Ghasidas Vishwavidyalaya, Bilaspur
关键词
Corona Virus; Covid-19; Data Analytics; ETL; Pandemic; Text Mining;
D O I
10.4108/eai.13-7-2018.165671
中图分类号
学科分类号
摘要
INTRODUCTION: The corona disease was first recognized in 2019 in Wuhan, which is the capital of China’s Hubei-province, and from then it continued spreading and as a result declared as a pandemic by all nations. The COVID-19 virus has different effects on people in various ways. It is a kind of respiratory disease. The confirmed cases are increasing day to day in India, which leads to complete lockdown throughout the nation. OBJECTIVE: The objective of this research is to design a novel Extract-Trandform and Load NETL model to analyse covid-19 data in india. METHODS: The extraction of useful information from a large database is a well-connected research field of text mining. This paper is proposed a novel extract-transform-load ETL model to process the COVID-19 data of India to get the exact recovery data from the multiple data sources from different states of India. In this, a knowledge-based model that generate knowledge based on three different module split, validation, and join is discussed. RESULTS: The outcomes of the proposed NETL process are, output file which has the description of total positive cases, active cases, recovery cases, and death rate, based on different regions. The analysis of NETL is done based on accuracy, failure count, and execution time. The proposed NETL process is more accurate and taking less compilation time with minimum failure count as compared with existing models. CONCLUSION: To analyze the coronavirus data in India, a novel ETL (NETL) model is proposed. In this model, a total of 9 CSV files is processed as input files to get different results in different categories. This model is having three modules namely splitting, verification, and join. The dataset is split into based on its coupling attributes and then joined with a single value to produce the updated results as per the current dataset. The last stage of this process is to join the data which is generated through splitting. The proposed NETL model is more accurate as compared with existing ETM models. © 2020 Amit Kumar Dewangan et al., licensed to EAI.
引用
收藏
页码:1 / 11
页数:10
相关论文
共 50 条
  • [1] A UNIFIED MODEL DRIVEN METHODOLOGY FOR DATA WAREHOUSES AND ETL DESIGN
    Atigui, Faten
    Ravat, Franck
    Tournier, Ronan
    Zurfluh, Gilles
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2011, : 247 - 252
  • [2] A fractional complex network model for novel corona virus in China
    H. A. A. El-Saka
    I. Obaya
    H. N. Agiza
    Advances in Difference Equations, 2021
  • [3] A fractional complex network model for novel corona virus in China
    El-Saka, H. A. A.
    Obaya, I
    Agiza, H. N.
    ADVANCES IN DIFFERENCE EQUATIONS, 2021, 2021 (01)
  • [4] A Novel Processing Model For Scds In ETL
    Sun, Li
    Zhang, Jiaoyan
    Li, Jiyun
    PROCEEDINGS OF THE 2017 2ND JOINT INTERNATIONAL INFORMATION TECHNOLOGY, MECHANICAL AND ELECTRONIC ENGINEERING CONFERENCE (JIMEC 2017), 2017, 62 : 133 - 136
  • [5] Novel Corona: Posthuman Virus
    Hayles, N. Katherine
    CRITICAL INQUIRY, 2021, 47 : S68 - S72
  • [6] Design of Marine Data Warehouse ETL System
    Wen WeiJun
    MECHANICAL COMPONENTS AND CONTROL ENGINEERING III, 2014, 668-669 : 1374 - 1377
  • [7] RESEARCH ON THE DESIGN OF WEB DATA WAREHOUSE BASED ON ETL META DATA MODEL AND PARTICLE SWARM OPTIMISATION
    Jun-Zhou, Li
    Nan, Yu
    JOURNAL OF THE BALKAN TRIBOLOGICAL ASSOCIATION, 2016, 22 (02): : 1184 - 1192
  • [8] A proposed model for data warehouse ETL processes
    El-Sappagh, Shaker H. Ali
    Hendawi, Abdeltawab M. Ahmed
    El Bastawissy, Ali Hamed
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2011, 23 (02) : 91 - 104
  • [9] Research and design of data processing based on ETL framework
    Guo, Xiao-Li
    Chen, Bo
    MODERN TECHNOLOGIES IN MATERIALS, MECHANICS AND INTELLIGENT SYSTEMS, 2014, 1049 : 1966 - +
  • [10] Sensitivity and elasticity analysis of novel corona virus transmission model: A mathematical approach
    Das K.
    Ranjith Kumar G.
    Madhusudhan Reddy K.
    Lakshminarayan K.
    Sensors International, 2021, 2