COVID-WAREHOUSE: A Data Warehouse of Italian COVID-19, Pollution, and Climate Data

被引:17
|
作者
Agapito, Giuseppe [1 ,2 ]
Zucco, Chiara [3 ]
Cannataro, Mario [2 ,3 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Legal Econ & Social Sci, I-88100 Catanzaro, Italy
[2] Magna Graecia Univ Catanzaro, Data Analyt Res Ctr, I-88100 Catanzaro, Italy
[3] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, I-88100 Catanzaro, Italy
关键词
Italian COVID-19 data; data analysis; data warehouse; data integration; pollution data; climate data;
D O I
10.3390/ijerph17155596
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The management of the COVID-19 pandemic presents several unprecedented challenges in different fields, from medicine to biology, from public health to social science, that may benefit from computing methods able to integrate the increasing available COVID-19 and related data (e.g., pollution, demographics, climate, etc.). With the aim to face the COVID-19 data collection, harmonization and integration problems, we present the design and development of COVID-WAREHOUSE, a data warehouse that models, integrates and stores the COVID-19 data made available daily by the Italian Protezione Civile Department and several pollution and climate data made available by the Italian Regions. After an automatic ETL (Extraction, Transformation and Loading) step, COVID-19 cases, pollution measures and climate data, are integrated and organized using the Dimensional Fact Model, using two main dimensions: time and geographical location. COVID-WAREHOUSE supports OLAP (On-Line Analytical Processing) analysis, provides a heatmap visualizer, and allows easy extraction of selected data for further analysis. The proposed tool can be used in the context of Public Health to underline how the pandemic is spreading, with respect to time and geographical location, and to correlate the pandemic to pollution and climate data in a specific region. Moreover, public decision-makers could use the tool to discover combinations of pollution and climate conditions correlated to an increase of the pandemic, and thus, they could act in a consequent manner. Case studies based on data cubes built on data from Lombardia and Puglia regions are discussed. Our preliminary findings indicate that COVID-19 pandemic is significantly spread in regions characterized by high concentration of particulate in the air and the absence of rain and wind, as even stated in other works available in literature.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 50 条
  • [1] Use of the Hefesto v2.0 methodology to implement a Data warehouse: Case applied COVID-19
    Hernandez Cruz, Luz Maria
    Barrera Lao, Francisco Javier
    Mex Alvarez, Diana Concepcion
    Castillo Tellez, Margarita
    Canto Canul, Roberto Carlos
    Solis May, Josue Israel
    Flores Guerrero, Mayra Deyanira
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,
  • [2] Data Warehouse and Data Virtualization
    Mousa, Ayad Hameed
    Shiratuddin, Norshuhada
    PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING DESE 2015, 2015, : 369 - 372
  • [3] Hybrid Data Warehouse Model for Climate Big Data Analysis
    Doreswamy
    Gad, Ibrahim
    Manjunatha, B. R.
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [4] Minable Data Warehouse
    Morgan, David
    Kang, Jai W.
    Kang, James M.
    ENTERPRISE INFORMATION SYSTEMS-BK, 2009, 24 : 125 - +
  • [5] Data Warehouse Testing
    Homayouni, Hajar
    Ghosh, Sudipto
    Ray, Indrakshi
    ADVANCES IN COMPUTERS, VOL 112, 2019, 112 : 223 - 273
  • [6] Data Warehouse Testing
    Golfarelli, Matteo
    Rizzi, Stefano
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2011, 7 (02) : 26 - 43
  • [7] An academic data warehouse
    Dell'Aquila, Carlo
    Di Tria, Francesco
    Lefons, Ezio
    Tangorra, Filippo
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATIONS, 2007, : 231 - 237
  • [8] On the Research of Data Warehouse in Big Data
    Qin, Hai-fei
    Qian, Zhi-ming
    Zhao, Yong-chao
    2015 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2015, : 354 - 357
  • [9] Data Warehouse and Data Quality - An Overview
    Brajkovic, Helena
    Jaksic, Danijela
    Poscic, Patrizia
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS (CECIIS 2020), 2020, : 17 - 24
  • [10] Data Quality in Data Warehouse Systems
    Serra, Flavia
    Marotta, Adriana
    PROCEEDINGS OF THE 2016 XLII LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2016,