Novel MIA-LSTM Deep Learning Hybrid Model with Data Preprocessing for Forecasting of PM2.5

被引:6
|
作者
Narkhede, Gaurav [1 ]
Hiwale, Anil [1 ]
Tidke, Bharat [2 ]
Khadse, Chetan [3 ]
机构
[1] MIT World Peace Univ, Sch Elect & Commun Engn, Pune 411038, India
[2] MIT World Peace Univ, Sch Comp Engn & Technol, Pune 411038, India
[3] MIT World Peace Univ, Sch Elect Engn, Pune 411038, India
关键词
MIA-LSTM; data preprocessing; iterative imputation; autoencoder; LSTM; MISSING VALUES; NEURAL-NETWORK; IMPUTATION; AIR; PREDICTION;
D O I
10.3390/a16010052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Day by day pollution in cities is increasing due to urbanization. One of the biggest challenges posed by the rapid migration of inhabitants into cities is increased air pollution. Sustainable Development Goal 11 indicates that 99 percent of the world's urban population breathes polluted air. In such a trend of urbanization, predicting the concentrations of pollutants in advance is very important. Predictions of pollutants would help city administrations to take timely measures for ensuring Sustainable Development Goal 11. In data engineering, imputation and the removal of outliers are very important steps prior to forecasting the concentration of air pollutants. For pollution and meteorological data, missing values and outliers are critical problems that need to be addressed. This paper proposes a novel method called multiple iterative imputation using autoencoder-based long short-term memory (MIA-LSTM) which uses iterative imputation using an extra tree regressor as an estimator for the missing values in multivariate data followed by an LSTM autoencoder for the detection and removal of outliers present in the dataset. The preprocessed data were given to a multivariate LSTM for forecasting PM2.5 concentration. This paper also presents the effect of removing outliers and missing values from the dataset as well as the effect of imputing missing values in the process of forecasting the concentrations of air pollutants. The proposed method provides better results for forecasting with a root mean square error (RMSE) value of 9.8883. The obtained results were compared with the traditional gated recurrent unit (GRU), 1D convolutional neural network (CNN), and long short-term memory (LSTM) approaches for a dataset of the Aotizhonhxin area of Beijing in China. Similar results were observed for another two locations in China and one location in India. The results obtained show that imputation and outlier/anomaly removal improve the accuracy of air pollution forecasting.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] An Ensemble Deep Learning Model for Forecasting Hourly PM2.5 Concentrations
    Mohan, Anju S.
    Abraham, Lizy
    IETE JOURNAL OF RESEARCH, 2023, 69 (10) : 6832 - 6845
  • [2] Novel convolution and LSTM model for forecasting PM2.5 concentration
    Zhao W.
    Zhou Y.
    Tang W.
    International Journal of Performability Engineering, 2019, 15 (06) : 1528 - 1537
  • [3] A Hybrid CNN-LSTM Model for Forecasting Particulate Matter (PM2.5)
    Li, Taoying
    Hua, Miao
    Wu, Xu
    IEEE ACCESS, 2020, 8 : 26933 - 26940
  • [4] Forecasting hourly PM2.5 concentration with an optimized LSTM model
    Tran, Huynh Duy
    Huang, Hsiang-Yu
    Yu, Jhih-Yuan
    Wang, Sheng-Hsiang
    ATMOSPHERIC ENVIRONMENT, 2023, 315
  • [5] A novel hybrid ensemble model for hourly PM2.5 concentration forecasting
    Zhang, L.
    Xu, L.
    Jiang, M.
    He, P.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2023, 20 (01) : 219 - 230
  • [6] A hybrid deep learning technology for PM2.5 air quality forecasting
    Zhang, Zhendong
    Zeng, Yongkang
    Yan, Ke
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (29) : 39409 - 39422
  • [7] The Forecasting of PM2.5 Using a Hybrid Model Based on Wavelet Transform and an Improved Deep Learning Algorithm
    Qiao, Weibiao
    Tian, Wencai
    Tian, Yu
    Yang, Quan
    Wang, Yining
    Zhang, Jianzhuang
    IEEE ACCESS, 2019, 7 : 142814 - 142825
  • [8] A graph-based LSTM model for PM2.5 forecasting
    Gao, Xi
    Li, Weide
    ATMOSPHERIC POLLUTION RESEARCH, 2021, 12 (09)
  • [9] An enhanced hybrid ensemble deep learning approach for forecasting daily PM2.5
    Liu Hui
    Deng Da-hua
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2022, 29 (06) : 2074 - 2083
  • [10] Forecasting PM2.5 in Malaysia Using a Hybrid Model
    Rahman, Ezahtulsyahreen Ab.
    Hamzah, Firdaus Mohamad
    Latif, Mohd Talib
    Azid, Azman
    AEROSOL AND AIR QUALITY RESEARCH, 2023, 23 (09)