Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

被引:3
|
作者
Pozo-Luyo, Cesar Alejandro [1 ]
Cruz-Duarte, Jorge M. [1 ]
Amaya, Ivan [1 ]
Ortiz-Bayliss, Jose Carlos [1 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Ave Eugenio Garza Sada 2501, Monterrey 64700, Nuevo Leon, Mexico
关键词
Air quality forecasting; PM2.5; forecasting; Machine learning; Regression; METEOROLOGICAL CONDITIONS; AIR-QUALITY; EXPOSURE;
D O I
10.1016/j.apr.2023.101898
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Monterrey Metropolitan Area is one of the most densely populated and polluted regions in Latin America. Hence, providing early warnings to the population when pollutant concentrations reach high levels is critical. This allows people at higher health risk to make informed decisions about when to go out, mitigating future health complications. Using forecasting models, we can produce timely warnings for future concentration levels. In this work, we implement a set of short-term shallow machine learning models that would serve as a baseline for future forecasting analyses of PM2.5 concentration levels in the Monterrey Metropolitan Area. The proposed approach starts with multiple imputation through chained equations for missing value imputation, the incorporation of time metadata, and target winsorization. Then, we rely on the well-known random search for parameter optimization of the machine learning models and k-fold cross-validation, obtaining favorable results. We devise these models for a single-step and single-station analysis on an hourly multivariate air quality dataset (containing 77203 rows and 16 columns from the first hour of January 1, 2015 00:00:00 to April 17, 2022 23:00:00) and compare them using standard regression metrics. Therefore, we identify the forecasting model with the best performance, which was an Extra Trees Regressor with a Root Mean Squared Error of 0.013, a Mean Absolute Error of 0.006 (equivalent to a Mean Absolute Percentage Error of 0.294% and a Symmetric Mean Absolute Percentage Error of 0.078%), and a Maximum Error of 0.187 mu g/m(3).
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Estimation of PM2.5 Concentrations in New York State: Understanding the Influence of Vertical Mixing on Surface PM2.5 Using Machine Learning
    Hung, Wei-Ting
    Lu, Cheng-Hsuan
    Alessandrini, Stefano
    Kumar, Rajesh
    Lin, Chin-An
    ATMOSPHERE, 2020, 11 (12) : 1 - 21
  • [22] A Development of PM2.5 Forecasting System in South Korea Using Chemical Transport Modeling and Machine Learning
    Koo, Youn-Seo
    Kwon, Hee-Yong
    Bae, Hyosik
    Yun, Hui-Young
    Choi, Dae-Ryun
    Yu, SukHyun
    Wang, Kyung-Hui
    Koo, Ji-Seok
    Lee, Jae-Bum
    Choi, Min-Hyeok
    Lee, Jeong-Beom
    ASIA-PACIFIC JOURNAL OF ATMOSPHERIC SCIENCES, 2023, 59 (05) : 577 - 595
  • [23] PM2.5 concentration simulation by hybrid machine learning based on image features
    Ma, Minjin
    Zhao, Zhenzhu
    Ma, Yuzhan
    Cao, Yidan
    Kang, Guoqiang
    FRONTIERS IN EARTH SCIENCE, 2025, 13
  • [24] A Development of PM2.5 Forecasting System in South Korea Using Chemical Transport Modeling and Machine Learning
    Youn-Seo Koo
    Hee-Yong Kwon
    Hyosik Bae
    Hui-Young Yun
    Dae-Ryun Choi
    SukHyun Yu
    Kyung-Hui Wang
    Ji-Seok Koo
    Jae-Bum Lee
    Min-Hyeok Choi
    Jeong-Beom Lee
    Asia-Pacific Journal of Atmospheric Sciences, 2023, 59 : 577 - 595
  • [25] PM2.5 concentration forecasting using ANFIS, EEMD-GRNN, MLP, and MLR models: a case study of Tehran, Iran
    Amanollahi, Jamil
    Ausati, Shadi
    AIR QUALITY ATMOSPHERE AND HEALTH, 2020, 13 (02) : 161 - 171
  • [26] Forecasting PM10 Concentrations in the Caribbean Area Using Machine Learning Models
    Plocoste, Thomas
    Laventure, Sylvio
    ATMOSPHERE, 2023, 14 (01)
  • [27] Predicting PM2.5 Concentration in the Yangtze River Delta Region Using Climate System Monitoring Indices and Machine Learning
    Ma, Jinghui
    Wan, Shiquan
    Xu, Shasha
    Wang, Chanjuan
    Qiu, Danni
    JOURNAL OF METEOROLOGICAL RESEARCH, 2024, 38 (02) : 249 - 261
  • [28] Predicting Fine Particulate Matter (PM2.5) in the Greater London Area: An Ensemble Approach using Machine Learning Methods
    Yazdi, Mahdieh Danesh
    Kuang, Zheng
    Dimakopoulou, Konstantina
    Barratt, Benjamin
    Suel, Esra
    Amini, Heresh
    Lyapustin, Alexei
    Katsouyanni, Klea
    Schwartz, Joel
    REMOTE SENSING, 2020, 12 (06)
  • [29] Machine Learning-Based Estimation of PM2.5 Concentration Using Ground Surface DoFP Polarimeters
    Takruri, Maen
    Abubakar, Abubakar
    Jallad, Abdul-Halim
    Altawil, Basel
    Marpu, Prashanth R.
    Bermak, Amine
    IEEE ACCESS, 2022, 10 : 23489 - 23496
  • [30] PM2.5 CONCENTRATION PREDICTION USING DEEP LEARNING IN AIR MONITORING
    Huang, Yi
    FRESENIUS ENVIRONMENTAL BULLETIN, 2021, 30 (12): : 13200 - 13211