Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

被引:3
|
作者
Pozo-Luyo, Cesar Alejandro [1 ]
Cruz-Duarte, Jorge M. [1 ]
Amaya, Ivan [1 ]
Ortiz-Bayliss, Jose Carlos [1 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Ave Eugenio Garza Sada 2501, Monterrey 64700, Nuevo Leon, Mexico
关键词
Air quality forecasting; PM2.5; forecasting; Machine learning; Regression; METEOROLOGICAL CONDITIONS; AIR-QUALITY; EXPOSURE;
D O I
10.1016/j.apr.2023.101898
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Monterrey Metropolitan Area is one of the most densely populated and polluted regions in Latin America. Hence, providing early warnings to the population when pollutant concentrations reach high levels is critical. This allows people at higher health risk to make informed decisions about when to go out, mitigating future health complications. Using forecasting models, we can produce timely warnings for future concentration levels. In this work, we implement a set of short-term shallow machine learning models that would serve as a baseline for future forecasting analyses of PM2.5 concentration levels in the Monterrey Metropolitan Area. The proposed approach starts with multiple imputation through chained equations for missing value imputation, the incorporation of time metadata, and target winsorization. Then, we rely on the well-known random search for parameter optimization of the machine learning models and k-fold cross-validation, obtaining favorable results. We devise these models for a single-step and single-station analysis on an hourly multivariate air quality dataset (containing 77203 rows and 16 columns from the first hour of January 1, 2015 00:00:00 to April 17, 2022 23:00:00) and compare them using standard regression metrics. Therefore, we identify the forecasting model with the best performance, which was an Extra Trees Regressor with a Root Mean Squared Error of 0.013, a Mean Absolute Error of 0.006 (equivalent to a Mean Absolute Percentage Error of 0.294% and a Symmetric Mean Absolute Percentage Error of 0.078%), and a Maximum Error of 0.187 mu g/m(3).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An Improved Weight Optimization of Hybrid Machine Learning Models for Forecasting Daily PM2.5 Concentration
    Ratchagit, Manlika
    CONTEMPORARY MATHEMATICS, 2024, 5 (03): : 3953 - 3970
  • [2] Time series forecasting of ozone levels in the Metropolitan Area of Monterrey, Mexico
    Iglesias-Gonzalez, S.
    Huertas, M.
    Hernandez-Paniagua, I
    Mendoza, A.
    15TH INTERNATIONAL CONFERENCE ON ATMOSPHERIC SCIENCES AND APPLICATIONS TO AIR QUALITY, 2020, 489
  • [3] Platinum concentration in PM2.5 in the Mexico City Metropolitan Area: relationship to meteorological conditions
    Garza-Galindo, Rodrigo
    Morton-Bermea, Ofelia
    Hernandez-Alvarez, Elizabeth
    Ordonez-Godinez, Sara L.
    Amador-Munoz, Omar
    Beramendi-Orosco, Laura
    Retama-Hernandez, Armando
    Miranda, Javier
    Rosas-Perez, Irma
    HUMAN AND ECOLOGICAL RISK ASSESSMENT, 2020, 26 (05): : 1164 - 1174
  • [4] Evaluation of Machine Learning Models for Ozone Concentration Forecasting in the Metropolitan Valley of Mexico
    Dominguez-Garcia, Rodrigo
    Arellano-Vazquez, Magali
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [5] Temporal heterogeneity in the performance of machine learning models for PM2.5 concentration estimation
    Li, Peizheng
    Huang, Shiqi
    Luo, Chenxi
    Li, Xiangying
    Zhang, Qingyu
    Wang, Jing
    Yang, Can
    Yang, Haomin
    Liao, Jianpeng
    Chen, Qihao
    Ma, Lu
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2024, 189 : 977 - 984
  • [6] Evaluation of Time Series Forecasting Models for Estimation of PM2.5 Levels in Air
    Garg, Satvik
    Jindal, Himanshu
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [7] Forecasting PM2.5 levels in Santiago de Chile using deep learning neural networks
    Menares, Camilo
    Perez, Patricio
    Parraguez, Santiago
    Fleming, Zoe L.
    URBAN CLIMATE, 2021, 38
  • [8] Evaluation of Different Machine Learning Approaches to Forecasting PM2.5 Mass Concentrations
    Karimian, Hamed
    Li, Qi
    Wu, Chunlin
    Qi, Yanlin
    Mo, Yuqin
    Chen, Gong
    Zhang, Xianfeng
    Sachdeva, Sonali
    AEROSOL AND AIR QUALITY RESEARCH, 2019, 19 (06) : 1400 - 1410
  • [9] A machine learning-based model to estimate PM2.5 concentration levels in Delhi's atmosphere
    Kumar, Saurabh
    Mishra, Shweta
    Singh, Sunil Kumar
    HELIYON, 2020, 6 (11)
  • [10] A Machine Learning Based PM2.5 Forecasting Framework Using Internet of Environmental Things
    Mahajan, Sachit
    Liu, Hao-Min
    Chen, Ling-Jyh
    Tsai, Tzu-Chieh
    IOT AS A SERVICE, IOTAAS 2017, 2018, 246 : 170 - 176