COVID-WAREHOUSE: A Data Warehouse of Italian COVID-19, Pollution, and Climate Data

被引:17
作者
Agapito, Giuseppe [1 ,2 ]
Zucco, Chiara [3 ]
Cannataro, Mario [2 ,3 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Legal Econ & Social Sci, I-88100 Catanzaro, Italy
[2] Magna Graecia Univ Catanzaro, Data Analyt Res Ctr, I-88100 Catanzaro, Italy
[3] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, I-88100 Catanzaro, Italy
关键词
Italian COVID-19 data; data analysis; data warehouse; data integration; pollution data; climate data;
D O I
10.3390/ijerph17155596
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The management of the COVID-19 pandemic presents several unprecedented challenges in different fields, from medicine to biology, from public health to social science, that may benefit from computing methods able to integrate the increasing available COVID-19 and related data (e.g., pollution, demographics, climate, etc.). With the aim to face the COVID-19 data collection, harmonization and integration problems, we present the design and development of COVID-WAREHOUSE, a data warehouse that models, integrates and stores the COVID-19 data made available daily by the Italian Protezione Civile Department and several pollution and climate data made available by the Italian Regions. After an automatic ETL (Extraction, Transformation and Loading) step, COVID-19 cases, pollution measures and climate data, are integrated and organized using the Dimensional Fact Model, using two main dimensions: time and geographical location. COVID-WAREHOUSE supports OLAP (On-Line Analytical Processing) analysis, provides a heatmap visualizer, and allows easy extraction of selected data for further analysis. The proposed tool can be used in the context of Public Health to underline how the pandemic is spreading, with respect to time and geographical location, and to correlate the pandemic to pollution and climate data in a specific region. Moreover, public decision-makers could use the tool to discover combinations of pollution and climate conditions correlated to an increase of the pandemic, and thus, they could act in a consequent manner. Case studies based on data cubes built on data from Lombardia and Puglia regions are discussed. Our preliminary findings indicate that COVID-19 pandemic is significantly spread in regions characterized by high concentration of particulate in the air and the absence of rain and wind, as even stated in other works available in literature.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 50 条
  • [41] Datawarehouser: A Data Warehouse artist who have ability to understand data warehouse schema pictures
    Warnars, Harco Leslie Hendric Spits
    Randriatoamanana, Richard
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2205 - 2208
  • [42] Automated Data Validation for Data Warehouse Testing
    Savanur, Sandhya
    Shreedhara, K. S.
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2016, : 223 - 226
  • [43] Data Integration Patterns for Data Warehouse Automation
    Tomingas, Kalle
    Kliimask, Margus
    Tammet, Tanel
    NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS II, 2015, 312 : 41 - 55
  • [44] Big Data Augmentation with Data Warehouse: A Survey
    Aftab, Umar
    Siddiqui, Ghazanfar Farooq
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2775 - 2784
  • [45] Warehouse creation - A potential roadblock to data warehousing
    Srivastava, J
    Chen, PY
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (01) : 118 - 126
  • [46] ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery
    Krishnakumar, Vivek
    Contrino, Sergio
    Cheng, Chia-Yi
    Belyaeva, Irina
    Ferlanti, Erik S.
    Miller, Jason R.
    Vaughn, Matthew W.
    Micklem, Gos
    Town, Christopher D.
    Chan, Agnes P.
    PLANT AND CELL PHYSIOLOGY, 2017, 58 (01) : e4
  • [47] COVID-19 outbreak data analysis and prediction
    Anandan R.
    Nalini T.
    Chiwhane S.
    Shanmuganathan M.
    Radhakrishnan P.
    Measurement: Sensors, 2023, 25
  • [48] Development of a statewide highway safety data warehouse: Massachusetts data warehouse and web-based access
    Rothenberg, HA
    Riessman, R
    Flatten, D
    JOURNAL OF SAFETY RESEARCH, 2005, 36 (05) : 467 - 469
  • [49] A relational-XML data warehouse for data aggregation with SQL and XQuery
    Fong, Joseph
    Shiu, Herbert
    Cheung, Davy
    SOFTWARE-PRACTICE & EXPERIENCE, 2008, 38 (11) : 1183 - 1213
  • [50] Data Warehouse Performance: Selected Techniques and Data Structures
    Wrembel, Robert
    BUSINESS INTELLIGENCE, 2012, 96 : 27 - 62