Novel MIA-LSTM Deep Learning Hybrid Model with Data Preprocessing for Forecasting of PM2.5

被引:6
|
作者
Narkhede, Gaurav [1 ]
Hiwale, Anil [1 ]
Tidke, Bharat [2 ]
Khadse, Chetan [3 ]
机构
[1] MIT World Peace Univ, Sch Elect & Commun Engn, Pune 411038, India
[2] MIT World Peace Univ, Sch Comp Engn & Technol, Pune 411038, India
[3] MIT World Peace Univ, Sch Elect Engn, Pune 411038, India
关键词
MIA-LSTM; data preprocessing; iterative imputation; autoencoder; LSTM; MISSING VALUES; NEURAL-NETWORK; IMPUTATION; AIR; PREDICTION;
D O I
10.3390/a16010052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Day by day pollution in cities is increasing due to urbanization. One of the biggest challenges posed by the rapid migration of inhabitants into cities is increased air pollution. Sustainable Development Goal 11 indicates that 99 percent of the world's urban population breathes polluted air. In such a trend of urbanization, predicting the concentrations of pollutants in advance is very important. Predictions of pollutants would help city administrations to take timely measures for ensuring Sustainable Development Goal 11. In data engineering, imputation and the removal of outliers are very important steps prior to forecasting the concentration of air pollutants. For pollution and meteorological data, missing values and outliers are critical problems that need to be addressed. This paper proposes a novel method called multiple iterative imputation using autoencoder-based long short-term memory (MIA-LSTM) which uses iterative imputation using an extra tree regressor as an estimator for the missing values in multivariate data followed by an LSTM autoencoder for the detection and removal of outliers present in the dataset. The preprocessed data were given to a multivariate LSTM for forecasting PM2.5 concentration. This paper also presents the effect of removing outliers and missing values from the dataset as well as the effect of imputing missing values in the process of forecasting the concentrations of air pollutants. The proposed method provides better results for forecasting with a root mean square error (RMSE) value of 9.8883. The obtained results were compared with the traditional gated recurrent unit (GRU), 1D convolutional neural network (CNN), and long short-term memory (LSTM) approaches for a dataset of the Aotizhonhxin area of Beijing in China. Similar results were observed for another two locations in China and one location in India. The results obtained show that imputation and outlier/anomaly removal improve the accuracy of air pollution forecasting.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Short-Term Prediction of PM2.5 Using LSTM Deep Learning Methods
    Kristiani, Endah
    Lin, Hao
    Jwu-Rong Lin
    Yen-Hsun Chuang
    Chin-Yin Huang
    Chao-Tung Yang
    SUSTAINABILITY, 2022, 14 (04)
  • [32] A hybrid-wavelet model applied for forecasting PM2.5 concentrations in Taiyuan city, China
    Wang, Ping
    Zhang, Guisheng
    Chen, Feng
    He, Yue
    ATMOSPHERIC POLLUTION RESEARCH, 2019, 10 (06) : 1884 - 1894
  • [33] Research on a Novel Hybrid Decomposition-Ensemble Learning Paradigm Based on VMD and IWOA for PM2.5 Forecasting
    Guo, Hengliang
    Guo, Yanling
    Zhang, Wenyu
    He, Xiaohui
    Qu, Zongxi
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (03) : 1 - 20
  • [34] FedDeep: A Federated Deep Learning Network for Edge Assisted Multi-Urban PM2.5 Forecasting
    Hu, Yue
    Cao, Ning
    Guo, Wangyong
    Chen, Meng
    Rong, Yi
    Lu, Hao
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [35] An improved deep learning model for predicting daily PM2.5 concentration
    Xiao, Fei
    Yang, Mei
    Fan, Hong
    Fan, Guanghui
    Al-qaness, Mohammed A. A.
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [36] A PM2.5 prediction model based on deep learning and random forest
    Peng H.
    Zhou Y.
    Hu X.
    Zhang L.
    Peng Y.
    Cai X.
    National Remote Sensing Bulletin, 2023, 27 (02) : 430 - 440
  • [37] A Multi Parameter Forecasting for Stock Time Series Data Using LSTM and Deep Learning Model
    Zaheer, Shahzad
    Anjum, Nadeem
    Hussain, Saddam
    Algarni, Abeer D. D.
    Iqbal, Jawaid
    Bourouis, Sami
    Ullah, Syed Sajid
    MATHEMATICS, 2023, 11 (03)
  • [38] Apply a deep learning hybrid model optimized by an Improved Chimp Optimization Algorithm in PM2.5 prediction
    Wei, Ming
    Du, Xiaopeng
    MACHINE LEARNING WITH APPLICATIONS, 2025, 19
  • [39] Spatiotemporal integration of GCN and E-LSTM networks for PM2.5 forecasting
    Mohammadzadeh, Ali Kamali
    Salah, Halima
    Jahanmahin, Roohollah
    Hussain, Abd E. Ali
    Masoud, Sara
    Huang, Yaoxian
    MACHINE LEARNING WITH APPLICATIONS, 2024, 15
  • [40] A novel hybrid strategy for PM2.5 concentration analysis and prediction
    Jiang, Ping
    Dong, Qingli
    Li, Peizhi
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2017, 196 : 443 - 457