Urban ozone variability using automated machine learning: inference from different feature importance schemes

被引:2
作者
Nath, Sankar Jyoti [1 ]
Girach, Imran A. [2 ]
Harithasree, S. [3 ,4 ]
Bhuyan, Kalyan [5 ]
Ojha, Narendra [3 ]
Kumar, Manish [1 ]
机构
[1] Ctr Environm & Energy Dev, Ranchi 834001, India
[2] Indian Space Res Org, Space Applicat Ctr, Ahmadabad 380015, India
[3] Phys Res Lab, Ahmadabad 380009, India
[4] Indian Inst Technol, Gandhinagar 382055, India
[5] Dibrugarh Univ, Ctr Atmospher Studies, Dibrugarh 786004, India
关键词
Air quality; Modelling; Ozone; Precursors; Meteorology; Artificial intelligence; Machine learning; AutoML; XGBoost; Random Forest; Air pollution; Atmospheric chemistry; SURFACE OZONE; POLLUTION; EXPOSURE; WEATHER; CHINA; SITE; O-3;
D O I
10.1007/s10661-024-12549-7
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Tropospheric ozone is an air pollutant at the ground level and a greenhouse gas which significantly contributes to the global warming. Strong anthropogenic emissions in and around urban environments enhance surface ozone pollution impacting the human health and vegetation adversely. However, observations are often scarce and the factors driving ozone variability remain uncertain in the developing regions of the world. In this regard, here, we conducted machine learning (ML) simulations of ozone variability and comprehensively examined the governing factors over a major urban environment (Ahmedabad) in western India. Ozone precursors (NO2, NO, CO, C5H8 and CH2O) from the CAMS (Copernicus Atmosphere Monitoring Service) reanalysis and meteorological parameters from the ERA5 (European Centre for Medium-Range Weather Forecast's (ECMWF) fifth-generation reanalysis) were included as features in the ML models. Automated ML (AutoML) fitted the deep learning model optimally and simulated the daily ozone with root mean square error (RMSE) of similar to 2 ppbv reproducing 84-88% of variability. The model performance achieved here is comparable to widely used ML models (RF-Random Forest and XGBoost-eXtreme Gradient Boosting). Explainability of the models is discussed through different schemes of feature importance, including SAGE (Shapley Additive Global importancE) and permutation importance. The leading features are found to be different from different feature importance schemes. We show that urban ozone could be simulated well (RMSE = 2.5 ppbv and R-2 = 0.78) by considering first four leading features, from different schemes, which are consistent with ozone photochemistry. Our study underscores the need to conduct science-informed analysis of feature importance from multiple schemes to infer the roles of input variables in ozone variability. AutoML-based studies, exploiting potentials of long-term observations, can strongly complement the conventional chemistry-transport modelling and can also help in accurate simulation and forecast of urban ozone.
引用
收藏
页数:15
相关论文
共 60 条
  • [1] Processes governing the surface ozone over a tropical hill station in the Western Ghats
    Ajayakumar, Revathy S.
    Girach, Imran A.
    Soni, Meghna
    Ojha, Narendra
    Babu, S. Suresh
    [J]. ATMOSPHERIC ENVIRONMENT, 2024, 319
  • [2] On the understanding of surface ozone variability, its precursors and their associations with atmospheric conditions over the Delhi region
    Anshika
    Kunchala, Ravi Kumar
    Attada, Raju
    Vellore, Ramesh K.
    Soni, Vijay K.
    Mohan, Manju
    Chilukoti, Nagaraju
    [J]. ATMOSPHERIC RESEARCH, 2021, 258
  • [3] Urban population exposure to tropospheric ozone: A multi-country forecasting of SOMO35 using artificial neural networks
    Antanasijevic, Davor
    Pocajt, Viktor
    Peric-Grujic, Aleksandra
    Ristic, Mirjana
    [J]. ENVIRONMENTAL POLLUTION, 2019, 244 : 288 - 294
  • [4] Importance of ozone precursors information in modelling urban surface ozone variability using machine learning algorithm
    Balamurugan, Vigneshkumar
    Balamurugan, Vinothkumar
    Chen, Jia
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Calvert J.G., 2015, The Mechanisms of Reactions Influencing Atmospheric Ozone, DOI [10.1093/oso/9780190233020.001.0001, DOI 10.1093/OSO/9780190233020.001.0001]
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] The influence of temperature on ozone production under varying NOx conditions - a modelling study
    Coates, Jane
    Mar, Kathleen A.
    Ojha, Narendra
    Butler, Tim M.
    [J]. ATMOSPHERIC CHEMISTRY AND PHYSICS, 2016, 16 (18) : 11601 - 11615
  • [9] Covert I, 2020, Arxiv, DOI arXiv:2004.00668
  • [10] Challenges and design choices for global weather and climate models based on machine learning
    Dueben, Peter D.
    Bauer, Peter
    [J]. GEOSCIENTIFIC MODEL DEVELOPMENT, 2018, 11 (10) : 3999 - 4009