Interpretable machine learning tools to analyze PM2.5 sensor network data so as to quantify local source impacts and long-range transport

被引:3
作者
de Foy, Benjamin [1 ]
Edwards, Ross [2 ]
Joy, Khaled Shaifullah [3 ,5 ]
Zaman, Shahid Uz [4 ]
Salam, Abdus [3 ]
Schauer, James J. [2 ,6 ]
机构
[1] St Louis Univ, Dept Earth & Atmospher Sci, St Louis, MO 63103 USA
[2] Wisconsin State Lab Hyg, Madison, WI USA
[3] Univ Dhaka, Dept Chem, Dhaka, Bangladesh
[4] Bangladesh Univ Engn & Technol, Dept Chem, Dhaka, Bangladesh
[5] Drexel Univ, Dept Chem, Philadelphia, PA 19104 USA
[6] Univ Wisconsin Madison, Environm Chem & Technol Program, Madison, WI USA
关键词
PM; 2.5; Low cost sensor; Sensor network; Generalized additive model; Interpretable machine learning; eXplainable artificial intelligence; PARTICLE DISPERSION MODEL;
D O I
10.1016/j.atmosres.2024.107656
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
Sensor networks provide spatially resolved information about the time variation of PM2.5 concentrations in urban areas around the world. With relatively simple improvements to the control of the temperature and humidity of incoming air, and with proper quality assurance and calibration protocols, a low cost monitor was developed that provides measurements that were highly correlated with a reference PM2.5 monitor. These have sufficient resolution and reliability to quantify small differences within an urban area. Nevertheless, many sites report similar concentrations and it can be difficult to interpret the results or distinguish local from regional effects. Generalized Additive Models are an effective Machine Learning method to distinguish the impact of factors across very different scales. As a type of Interpretable Machine Learning / eXplainable Artificial Intelligence, they provide direct information on the link between specific factors and PM2.5 concentrations. GAM simulations were developed for sensors located around Dhaka, Bangladesh, for both the dry and the wet seasons. The simulations show that the largest contributor to high PM2.5 concentration variations across both urban and peri-urban sites is the boundary layer height which represent the vertical mixing of the urban plume. Using Trajectory Cluster Concentration Impacts, the simulations showed that robust estimates of long-range transport could be obtained from measurements located within a polluted environment, and the model further showed that enhancements of more than 40 mu g/m3 were associated with air transport from the Indo-Gangetic Plain in the dry season. Finally, interaction maps of the effect of horizontal wind speed and direction showed that these could be associated with up to +/-20 % variation in PM2.5 from site to site. Most of the enhancements are related to very calm winds and appear to be more strongly associated with road emissions than with point sources. Overall, the sensor network shows that air is polluted throughout the Dhaka area and into the peripheries, and that a multipronged approach will be needed to improve air quality for its inhabitants.
引用
收藏
页数:19
相关论文
共 45 条
[1]   Will the circle be unbroken: A history of the US national ambient air quality standards [J].
Bachmann, John .
JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 2007, 57 (06) :652-697
[2]  
Burkart N, 2021, J ARTIF INTELL RES, V70, P245
[3]   Highly local sources and large spatial variations in PM2.5 across a city: evidence from a city-wide sensor network in Cork, Ireland [J].
Byrne, Rosin ;
Ryan, Kevin ;
Venables, Dean S. ;
Wenger, John C. ;
Hellebust, Stig .
ENVIRONMENTAL SCIENCE-ATMOSPHERES, 2023, 3 (05) :919-930
[4]   Modelling and assessing trends in traffic-related emissions using a generalised additive modelling approach [J].
Carslaw, David C. ;
Beevers, Sean D. ;
Tate, James E. .
ATMOSPHERIC ENVIRONMENT, 2007, 41 (26) :5289-5299
[5]   openair - An R package for air quality data analysis [J].
Carslaw, David C. ;
Ropkins, Karl .
ENVIRONMENTAL MODELLING & SOFTWARE, 2012, 27-28 :52-61
[6]   How Interpretable and Trustworthy are GAMs? [J].
Chang, Chun-Hao ;
Tan, Sarah ;
Lengerich, Ben ;
Goldenberg, Anna ;
Caruana, Rich .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :95-105
[7]  
Copernicus Climate Change Service (C3S), 2017, ERA5 5 GEN ECMWF ATM
[8]  
Davison J., 2024, Openairmaps: Create Maps of Air Pollution Data
[9]   Rapid ventilation of the Mexico City basin and regional fate of the urban plume [J].
de Foy, B. ;
Varela, J. R. ;
Molina, L. T. ;
Molina, M. J. .
ATMOSPHERIC CHEMISTRY AND PHYSICS, 2006, 6 :2321-2335
[10]   Interpretable diurnal impacts on extreme urban PM2.5 concentrations of soil temperature, soil water content, humidity and temperature inversion [J].
de Foy, Benjamin ;
Schauer, James J. .
ATMOSPHERIC RESEARCH, 2024, 307