COVID-WAREHOUSE: A Data Warehouse of Italian COVID-19, Pollution, and Climate Data

被引:17
|
作者
Agapito, Giuseppe [1 ,2 ]
Zucco, Chiara [3 ]
Cannataro, Mario [2 ,3 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Legal Econ & Social Sci, I-88100 Catanzaro, Italy
[2] Magna Graecia Univ Catanzaro, Data Analyt Res Ctr, I-88100 Catanzaro, Italy
[3] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, I-88100 Catanzaro, Italy
关键词
Italian COVID-19 data; data analysis; data warehouse; data integration; pollution data; climate data;
D O I
10.3390/ijerph17155596
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The management of the COVID-19 pandemic presents several unprecedented challenges in different fields, from medicine to biology, from public health to social science, that may benefit from computing methods able to integrate the increasing available COVID-19 and related data (e.g., pollution, demographics, climate, etc.). With the aim to face the COVID-19 data collection, harmonization and integration problems, we present the design and development of COVID-WAREHOUSE, a data warehouse that models, integrates and stores the COVID-19 data made available daily by the Italian Protezione Civile Department and several pollution and climate data made available by the Italian Regions. After an automatic ETL (Extraction, Transformation and Loading) step, COVID-19 cases, pollution measures and climate data, are integrated and organized using the Dimensional Fact Model, using two main dimensions: time and geographical location. COVID-WAREHOUSE supports OLAP (On-Line Analytical Processing) analysis, provides a heatmap visualizer, and allows easy extraction of selected data for further analysis. The proposed tool can be used in the context of Public Health to underline how the pandemic is spreading, with respect to time and geographical location, and to correlate the pandemic to pollution and climate data in a specific region. Moreover, public decision-makers could use the tool to discover combinations of pollution and climate conditions correlated to an increase of the pandemic, and thus, they could act in a consequent manner. Case studies based on data cubes built on data from Lombardia and Puglia regions are discussed. Our preliminary findings indicate that COVID-19 pandemic is significantly spread in regions characterized by high concentration of particulate in the air and the absence of rain and wind, as even stated in other works available in literature.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 50 条
  • [31] Volunteer Design of Data Warehouse
    Bimonte, Sandro
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2020, 2020, 12127 : 561 - 562
  • [32] Summarizability in Multiversion Data Warehouse
    Turki, Ines Zouari
    Jedidi, Faiza Ghozzi
    Bouaziz, Rafik
    2014 21ST INTERNATIONAL SYMPOSIUM ON TEMPORAL REPRESENTATION AND REASONING (TIME 2014), 2014, : 111 - 120
  • [33] Extending the data warehouse for service provisioning data
    Kotidis, Yannis
    DATA & KNOWLEDGE ENGINEERING, 2006, 59 (03) : 700 - 724
  • [34] A Comparative Analysis of Data Warehouse Data Models
    Bojicic, Ivan
    Marjanovic, Zoran
    Turajlic, Nina
    Petrovic, Marko
    Vuckovic, Milica
    Jovanovic, Vladan
    2016 6TH INTERNATIONAL CONFERENCE ON COMPUTERS COMMUNICATIONS AND CONTROL (ICCCC), 2016, : 151 - 159
  • [35] Big Data Augmentation with Data Warehouse: A Survey
    Aftab, Umar
    Siddiqui, Ghazanfar Farooq
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2785 - 2794
  • [36] Cacophonic contributions to data quality in the data warehouse
    Rasmussen, Karsten Boye
    WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 7, 2005, : 311 - 316
  • [37] Data Warehouse Design for Big Data in Academia
    Rudniy, Alex
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 979 - 992
  • [38] The Applications of Data Mining in Tax Data Warehouse
    Tao, Wang
    Ning, Guo
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING, 2009, : 103 - +
  • [39] Data Warehouses Federation as a Single Data Warehouse
    Kern, Rafal
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT I, 2016, 9875 : 356 - 366
  • [40] A data warehouse architecture for clinical data warehousing
    Faculty of Information Technology, Queensland University of Technology, PO Box 2434, Brisbane 4001, QLD, Australia
    Conf. Res. Pract. Inf. Technol. Ser., 2007, (227-232):