Evaluation of different machine learning approaches for predicting high concentration episodes of ground-level ozone: A case study in Catalonia, Spain

被引:6
作者
Vicente, D. J. [1 ]
Salazar, F. [1 ,2 ]
Lopez-Chacon, S. R. [1 ]
Soriano, C. [1 ]
Martin-Vide, J. [3 ]
机构
[1] Int Ctr Numer Methods Engn CIMNE, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Flumen Res Inst, Barcelona 08034, Spain
[3] Univ Barcelona, Dept Geog, IdRA Climatol Grp, Barcelona, Spain
关键词
Ozone; Air pollution; Machine learning; High ozone episodes; Random forest; SUPPORT VECTOR MACHINE; SURFACE-OZONE; SPATIOTEMPORAL PREDICTION; CHINA; MODEL; CLASSIFICATION; POLLUTION;
D O I
10.1016/j.apr.2023.101999
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Ground-level ozone (O-3) is a pollutant with a great impact on human health and the environment. As a secondary air contaminant of photochemical origin, those areas with greater exposure to solar radiation, such as Spain and other Mediterranean countries, are considerably affected. With the aggravation of O-3 pollution, it is important to provide reliable forecasting tools to help stakeholders implement more effective policies to mitigate the negative impact associated with this problem. In this regard, Machine Learning-based models have emerged in recent years, since they are able to identify complex relationships between ozone levels and relevant variables. However, their application to capture the most extreme events remains difficult. In this work, different ML approaches for predicting daily maximum 8-h average ozone (O-3,O-MDA8) were compared, investigating their ability to forecast the highest concentration levels recorded. Two variants of the Random Forest algorithm (regression and classification) were applied to a specific area of Catalonia, Spain, with a special interest due to the high number of episodes of exceedance of O-3 concentration levels. The predictive models were built with a 1 day time horizon, using datasets from 2002 to 2020. The variables used as inputs were other air pollutants concentrations and meteorological processes, monitored the day before to the target day to be predicted, and time information. Although results showed reasonable overall performances, low accuracy was achieved when forecasting the highest episodes of O-3,O-MDA8. To improve the capacity of the models in predicting high-O-3,O-MDA8 concentration levels, a methodology was proposed to fine-tuning the original predictions of the ML models according to a classification metric, G-Mean, which allows adjusting the balance between the correct predictions of different classes. Using the Sensitivity and Specificity metrics, the classical approaches were compared with the original ones proposed in the present study. The results obtained, for all the cases analysed, showed a mean increase in Sensitivity of 0.28, associated with a greater number of True Positives (correct predictions of high O-3-episodes). On the other hand, the average Specificity value decreased, due to the appearance of a greater number of False Positives, although this reduction was only 0.05. The proposed criteria showed promising results, better balancing classification metrics and increasing the ratio of correct predictions linked to the higher ranges of O-3.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Evaluation of different machine learning algorithms for predicting the length of stay in the emergency departments: a single-centre study
    Ricciardi, Carlo
    Marino, Marta Rosaria
    Trunfio, Teresa Angela
    Majolo, Massimo
    Romano, Maria
    Amato, Francesco
    Improta, Giovanni
    FRONTIERS IN DIGITAL HEALTH, 2024, 5
  • [42] A comparative study of different machine learning approaches for predicting cutting force and surface roughness during ultrasonic-assisted milling
    El-Taybany, Yasmine
    Elhendawy, Ghada A.
    INTERNATIONAL JOURNAL OF MANUFACTURING RESEARCH, 2024, 19 (03)
  • [43] Comparative study of different machine learning approaches for predicting the compressive strength of palm fuel ash concrete
    Kellouche, Yasmina
    Tayeh, Bassam A.
    Chetbani, Yazid
    Zeyad, Abdullah M.
    Mostafa, Sahar A.
    JOURNAL OF BUILDING ENGINEERING, 2024, 88
  • [44] Two-Parameter Central Fitting Distribution to Predict the Concentration of Ground Level Ozone: Case Study in Industrial Area
    Hamid, Hazrul Abdul
    Jaffar, Ismail
    Raffee, Ahmad Fauzi
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS, ENGINEERING AND INDUSTRIAL APPLICATIONS 2018 (ICOMEIA 2018), 2018, 2013
  • [45] Applying Fuzzy Inference and Machine Learning Methods for Prediction with a Small Dataset: A Case Study for Predicting the Consequences of Oil Spills on a Ground Environment
    Burmakova, Anastasiya
    Kalibatiene, Diana
    APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [46] Prediction of atmospheric carbon monoxide concentration utilizing different machine learning algorithms: A case study in Kuala Lumpur, Malaysia
    Latif, Sarmad Dashti
    Almalayih, Mustafa
    Yafouz, Ayman
    Ahmed, Ali Najah
    Zaini, Nuratiah
    Irwan, Dani
    AlDahoul, Nouar
    Sherif, Mohsen
    El-Shafie, Ahmed
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2023, 32
  • [47] Evaluating different machine learning models for predicting municipal solid waste generation: a case study of Malaysia
    Sarmad Dashti Latif
    Nur Alyaa Binti Hazrin
    Mohammad K. Younes
    Ali Najah Ahmed
    Ahmed Elshafie
    Environment, Development and Sustainability, 2024, 26 : 12489 - 12512
  • [48] Evaluating different machine learning models for predicting municipal solid waste generation: a case study of Malaysia
    Latif, Sarmad Dashti
    Hazrin, Nur Alyaa Binti
    Younes, Mohammad K.
    Ahmed, Ali Najah
    Elshafie, Ahmed
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2024, 26 (05) : 12489 - 12512
  • [49] Evaluating different machine learning models for predicting municipal solid waste generation: a case study of Malaysia
    Latif, Sarmad Dashti
    Hazrin, Nur Alyaa Binti
    Younes, Mohammad K.
    Ahmed, Ali Najah
    Elshafie, Ahmed
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2024, 26 (05) : 12489 - 12512
  • [50] High-Resolution Daily Spatiotemporal Distribution and Evaluation of Ground-Level Nitrogen Dioxide Concentration in the Beijing-Tianjin-Hebei Region Based on TROPOMI Data
    Liu, Chunhui
    Wu, Sensen
    Dai, Zhen
    Wang, Yuanyuan
    Du, Zhenhong
    Liu, Xingyu
    Qiu, Chunxia
    REMOTE SENSING, 2023, 15 (15)