Lightweight Open Data Assimilation of Pan-European Urban Air Quality

被引:0
作者
Miasayedava, Lizaveta [1 ,2 ]
Kaugerand, Jaanus [1 ]
Tuhtan, Jeffrey A. [2 ]
机构
[1] Tallinn Univ Technol, Res Lab Proact Technol, Tallinn 12616, Estonia
[2] Tallinn Univ Technol, Dept Comp Syst, Tallinn 12616, Estonia
关键词
Ambient air quality; data assimilation; environmental monitoring; open data; uncertainty quantification; UNCERTAINTY;
D O I
10.1109/ACCESS.2023.3302348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The number of ambient air quality monitoring stations is growing globally, driven by the need to quantify potential health risks posed by air pollution on urban populations. Reliable, robust and interoperable air quality monitoring requires observations with consistent accuracy and low amounts of missing data. In practice, this is challenging to achieve due to the measurement limitations and complexity of the physical phenomena. Data assimilation methods are widely used to fill missing or faulty observations and improve data quality by combining observations from fixed air quality monitoring ground stations with large-scale numerical models. A further advantage of data assimilation is that it can decrease costs by reusing existing open government data. A key requirement for assimilation is that uncertainty estimates are available for both measurements and model data. However, this poses a major bottleneck for widespread data assimilation with open data because uncertainty estimates are frequently unavailable. Additional challenges addressed in this work include the needs to impute missing data and process observations and model simulation results at different temporal and spatial scales. To address these challenges, we have developed novel, lightweight data assimilation algorithms based on recursive least-squares. The algorithms provide a fully data-driven way to estimate unknown uncertainties by defining the weights of the input data sources using least-squares data assimilation. The lightweight data assimilation algorithms can be executed to update the current state estimate in near real-time scenarios to improve the accuracy, completeness, and precision of the analysis estimate. A sensitivity analysis is conducted using synthetic data based on logistic maps with increasing noise levels. In addition, the proposed assimilation algorithms are applied to large-scale open pan-European air quality monitoring station data. The data were obtained from 86 stations for CO, 593 stations for NO2, 462 stations for O-3, 137 stations for SO2, 254 stations for PM2.5, and 445 stations for PM10 in the period from 2022-01-27 01:00:00 to 2022-02-25 15:00:00 from the European Environmental Agency (EEA) and corresponding simulation results from the System for Integrated modeLling of Atmospheric composition (SILAM, global, version 5.7, FRC forecasts at the surface). The proposed lightweight data assimilation methods were found suitable to improve the completeness (filling in all missing data), accuracy (taken as the RMSE between the assimilation results and ground station observations) and precision for all of the open air quality parameters evaluated in this work. Furthermore, the proposed lightweight assimilation algorithms may also provide new and cost-effective methods to improve the data quality of the growing number of Internet of Things (IoT) urban air quality sensors.
引用
收藏
页码:84670 / 84688
页数:19
相关论文
共 32 条
[1]   Comparative Analysis of Machine Learning Techniques for Predicting Air Quality in Smart Cities [J].
Ameer, Saba ;
Shah, Munam Ali ;
Khan, Abid ;
Song, Houbing ;
Maple, Carsten ;
Ul Islam, Saif ;
Asghar, Muhammad Nabeel .
IEEE ACCESS, 2019, 7 :128325-128338
[2]   Mapping pollution exposure and chemistry during an extreme air quality event (the 2018 K=ilauea eruption) using a low-cost sensor network [J].
Crawford, Ben ;
Hagan, David H. ;
Grossman, Ilene ;
Cole, Elizabeth ;
Holland, Lacey ;
Heald, Colette L. ;
Kroll, Jesse H. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (27)
[3]   Towards uncertainty mapping in air-quality modelling and assessment [J].
Denby, Bruce ;
Dudek, Agnes ;
Walker, Sam Erik ;
Costa, Ana Margarida ;
Monteiro, Alexandra ;
van den Elshout, Sef ;
Fisher, Bernard .
INTERNATIONAL JOURNAL OF ENVIRONMENT AND POLLUTION, 2011, 44 (1-4) :14-23
[4]   Numerical Dust Storm Simulation Using Modified Geographical Domain and Data Assimilation: 3DVAR and 4DVAR (WRF-Chem/WRFDA) [J].
Eltahan, Muhammed ;
Alahmadi, Sabah .
IEEE ACCESS, 2019, 7 :128980-128989
[5]  
European Environment Agency, 2022, DOWNL AIR QUAL DAT
[6]  
Evensen G, 2003, OCEAN DYNAM, V53, P343, DOI [DOI 10.1007/S10236-003-0036-9, 10.1007/s10236-003-0036-9]
[7]   Combining a Fully Connected Neural Network With an Ensemble Kalman Filter to Emulate a Dynamic Model in Data Assimilation [J].
Fan, Manhong ;
Bai, Yulong ;
Wang, Lili ;
Ding, Lin .
IEEE ACCESS, 2021, 9 :144952-144964
[8]  
Fine J. D., 2003, 54222 LAWR BERK NAT
[9]  
Finnish Meteorological Institute, 2022, SILAM V5 7 SYST INT
[10]   Data fusion for air quality mapping using low-cost sensor observations: Feasibility and added-value [J].
Gressent, Alicia ;
Malherbe, Laure ;
Colette, Augustin ;
Rollin, Hugo ;
Scimia, Romain .
ENVIRONMENT INTERNATIONAL, 2020, 143