A data-driven approach for PM2.5 estimation in a metropolis: random forest modeling based on ERA5 reanalysis data

被引:2
作者
Gundogdu, Serdar [1 ]
Elbir, Tolga [2 ]
机构
[1] Dokuz Eylul Univ, Bergama Vocat Sch, Dept Comp Technol, Izmir, Turkiye
[2] Dokuz Eylul Univ, Fac Engn, Dept Environm Engn, Buca Izmir, Turkiye
来源
ENVIRONMENTAL RESEARCH COMMUNICATIONS | 2024年 / 6卷 / 03期
关键词
PM2.5; estimation; random forest; ERA5; reanalysis; Ankara; CRITERIA AIR-POLLUTANTS; CHINA; COMBUSTION; POLLUTION; TRENDS; ANKARA; IZMIR; PM10; CITY;
D O I
10.1088/2515-7620/ad352d
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Air pollution in urban environments, particularly from fine particulate matter (PM2.5), poses significant health risks. Addressing this issue, the current study developed a Random Forest (RF) model to estimate hourly PM2.5 concentrations in Ankara, T & uuml;rkiye. Utilizing ERA5 reanalysis data, the model incorporated various meteorological and environmental variables. Over the period 2020-2021, the model's performance was validated against data from eleven air quality monitoring stations, demonstrating a robust coefficient of determination (R-2) of 0.73, signifying its strong predictive capability. Low root mean squared error (RMSE) and mean absolute error (MAE) values further affirmed the model's precision. Seasonal and temporal analysis revealed the model's adaptability, with autumn showing the highest accuracy (R-2 = 0.82) and summer the least (R-2 = 0.51), suggesting seasonal variability in predictive performance. Hourly evaluations indicated the model's highest accuracy at 23:00 (R-2 = 0.93), reflecting a solid alignment with observed data during nocturnal hours. On a monthly scale, November's predictions were the most precise (R-2 = 0.82), while May presented challenges in accuracy (R-2 = 0.49). These seasonal and monthly fluctuations underscore the complex interplay of atmospheric dynamics affecting PM2.5 dispersion. By integrating key determinants such as ambient air temperature, surface pressure, total column water vapor, boundary layer height, forecast albedo, and leaf area index, this study enhances the understanding of air pollution patterns in urban settings. The RF model's comprehensive evaluation across time scales offers valuable insights for policymakers and environmental health practitioners, supporting evidence-based strategies for air quality management.
引用
收藏
页数:18
相关论文
共 67 条
[1]   An ensemble multi-step-ahead forecasting system for fine particulate matter in urban areas [J].
Ahani, Ida Kalate ;
Salari, Majid ;
Shadman, Alireza .
JOURNAL OF CLEANER PRODUCTION, 2020, 263
[2]   Estimating fine particulate concentration using a combined approach of linear regression and artificial neural network [J].
Ahmad, Maqbool ;
Alam, Khan ;
Tariq, Shahina ;
Anwar, Sajid ;
Nasir, Jawad ;
Mansha, Muhammad .
ATMOSPHERIC ENVIRONMENT, 2019, 219
[3]   Future Health Risk Assessment of Exposure to PM2.5 in Different Age Groups of Children in Northern Thailand [J].
Amnuaylojaroen, Teerachai ;
Parasin, Nichapa .
TOXICS, 2023, 11 (03)
[4]   Fifteen-year trends in criteria air pollutants in oil sands communities of Alberta, Canada [J].
Bari, Md. ;
Kindzierski, Warren B. .
ENVIRONMENT INTERNATIONAL, 2015, 74 :200-208
[5]   The Impact of Fine Particulate Matter 2.5 on the Cardiovascular System: A Review of the Invisible Killer [J].
Basith, Shaherin ;
Manavalan, Balachandran ;
Shin, Tae Hwan ;
Park, Chan Bae ;
Lee, Wang-Soo ;
Kim, Jaetaek ;
Lee, Gwang .
NANOMATERIALS, 2022, 12 (15)
[6]   Impact of synoptic patterns and meteorological elements on the wintertime haze in the Beijing-Tianjin-Hebei region, China from 2013 to 2017 [J].
Bei, Naifang ;
Li, Xiaopei ;
Tie, Xuexi ;
Zhao, Linna ;
Wu, Jiarui ;
Li, Xia ;
Liu, Lang ;
Shen, Zhenxing ;
Li, Guohui .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 704
[7]  
Bera Biswajit, 2021, Environ Chall (Amst), V4, P100155, DOI [10.1016/j.envc.2021.100155, 10.1016/j.envc.2021.100155]
[8]   Spatial prediction of PM10 concentration using machine learning algorithms in Ankara, Turkey [J].
Bozdag, Asli ;
Dokuz, Yesim ;
Gokcek, Oznur Begum .
ENVIRONMENTAL POLLUTION, 2020, 263
[9]   SEDE-GPS: socio-economic data enrichment based on GPS information [J].
Sperlea, Theodor ;
Fueser, Stefan ;
Boenigk, Jens ;
Heider, Dominik .
BMC BIOINFORMATICS, 2018, 19
[10]   Data-driven interpretable ensemble learning methods for the prediction of wind turbine power incorporating SHAP analysis [J].
Cakiroglu, Celal ;
Demir, Sercan ;
Ozdemir, Mehmet Hakan ;
Aylak, Batin Latif ;
Sariisik, Gencay ;
Abualigah, Laith .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237