Evaluating traditional versus ensemble machine learning methods for predicting missing data of daily PM10 concentration

被引:4
|
作者
Kalantari, Elham [1 ]
Gholami, Hamid [1 ]
Malakooti, Hossein [2 ]
Eftekhari, Mahdi [3 ]
Saneei, Poorya [3 ]
Esfandiarpour, Donya [3 ]
Moosavi, Vahid [4 ]
Nafarzadegan, Ali Reza [1 ]
机构
[1] Univ Hormozgan, Dept Nat Resources Engn, Bandar Abbas, Hormozgan, Iran
[2] Univ Hormozgan, Fac Marine Sci & Technol, Dept Marine & Atmospher Sci Non Biol, Bandar Abbas, Iran
[3] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[4] Tarbiat Modares Univ, Dept Watershed Management Engn, Noor, Mazandaran, Iran
关键词
Machine learning; PM; 10; prediction; XGBoost; Time series; Zabol; ARTIFICIAL NEURAL-NETWORKS; AIR; INTERPOLATION; EMISSIONS;
D O I
10.1016/j.apr.2024.102063
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The aim of this study was to predict the missing data of PM10 for the city of Zabol using various traditional learning methods, Lazy Learning, and Ensemble Learning. In this study, daily minimum, average, and maximum data of weather variables were collected, along with daily PM10 concentration from the Zabol airport weather station during the years 2013-2022. To compare the performance of the predictive models, R2, mean absolute error (MAE), and mean squared error (MSE) criteria were used. The reconstruction results show that collective learning models, especially XGBoost, can be effectively used to predict missing PM10 data in time series. Additionally, among ensemble learning methods, boosting algorithms provide higher accuracy in predicting missing PM10 data than packing algorithms. It was also found that, according to the results, among the traditional learning methods, lazy learning models performed better than eager learning models. In order of efficiency and accuracy for predicting PM10 missing data, the models include XGBoost, random forest (RF), Extra Trees (ET), Light gradient boosting machine (GBM), The Decision Tree regressor with the Bagging method, gradient boosting (GB), Ada Boost, Weighted K-Nearest Neighbor (WKNN), K-Nearest Neighbor (KNN), The Decision Tree Regressor with the Pasting method, artificial neural network (ANN), Decision Tree (DT), and linear regression (LR). In general, given the high processing capability and potential of collective learning methods in the field of predicting missing PM10 data, this technique is considered a useful solution for saving time, energy, and costs of collecting and measuring data. It can also replace missing data in the case of any equipment malfunction or damage. This approach can also be used to predict pollutant concentrations in weather systems.
引用
收藏
页数:11
相关论文
共 33 条
  • [1] Machine Learning Methods to Forecast the Concentration of PM10 in Lublin, Poland
    Kujawska, Justyna
    Kulisz, Monika
    Oleszczuk, Piotr
    Cel, Wojciech
    ENERGIES, 2022, 15 (17)
  • [2] Comparison of four machine learning methods for predicting PM10 concentrations in Helsinki, Finland
    Zickus, M
    Greig, AJ
    Niranjan, M
    URBAN AIR QUALITY - RECENT ADVANCES, PROCEEDINGS, 2002, : 717 - 729
  • [3] Leveraging Satellite Data for Predicting PM10 Concentration with Machine Learning Models: A Study in the Plains of North Bengal, India
    Das, Ayan
    Sahu, Manoranjan
    AEROSOL AND AIR QUALITY RESEARCH, 2024, 24 (12)
  • [4] Accuracy Analysis of Machine Learning Methods for Predicting PM Concentration
    Kim, Yeong-Il
    Lee, Kwon-Ho
    JOURNAL OF KOREAN SOCIETY FOR ATMOSPHERIC ENVIRONMENT, 2023, 39 (02) : 149 - 164
  • [5] Managing air quality: Predicting exceedances of legal limits for PM10 and O3 concentration using machine learning methods
    Krylova, Maryna
    Okhrin, Yarema
    ENVIRONMETRICS, 2022, 33 (02)
  • [6] Predicting PM10 and PM2.5 concentration in container ports: A deep learning approach
    Park, So -Young
    Woo, Su-Han
    Lim, Changwon
    TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2023, 115
  • [7] An integrated feature selection and machine learning framework for PM10 concentration prediction
    Kalantari, Elham
    Gholami, Hamid
    Malakooti, Hossein
    Kaskaoutis, Dimitris G.
    Saneei, Poorya
    ATMOSPHERIC POLLUTION RESEARCH, 2025, 16 (05)
  • [8] Modelling and Forecasting Temporal PM2.5 Concentration Using Ensemble Machine Learning Methods
    Ejohwomu, Obuks Augustine
    Shamsideen Oshodi, Olakekan
    Oladokun, Majeed
    Bukoye, Oyegoke Teslim
    Emekwuru, Nwabueze
    Sotunbo, Adegboyega
    Adenuga, Olumide
    BUILDINGS, 2022, 12 (01)
  • [9] A Case Analysis of Dust Weather and Prediction of PM10 Concentration Based on Machine Learning at the Tibetan Plateau
    Tan, Changrong
    Chen, Qi
    Qi, Donglin
    Xu, Liang
    Wang, Jiayun
    ATMOSPHERE, 2022, 13 (06)
  • [10] Research and application of a novel hybrid decomposition-ensemble learning paradigm with error correction for daily PM10 forecasting
    Luo, Hongyuan
    Wang, Deyun
    Yue, Chenqiang
    Liu, Yanling
    Guo, Haixiang
    ATMOSPHERIC RESEARCH, 2018, 201 : 34 - 45