Air quality predictions with a semi-supervised bidirectional LSTM neural network

被引:112
作者
Zhang, Luo [1 ,2 ]
Liu, Peng [1 ]
Zhao, Lei [3 ]
Wang, Guizhou [1 ]
Zhang, Wangfeng [4 ]
Liu, Jianbo [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
[4] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Air quality; PM2.5; predictions; Bidirectional long short-term memory; Empirical mode decomposition; Semi-supervised; SHORT-TERM-MEMORY; YANGTZE-RIVER DELTA; REMOTE-SENSING DATA; PM2.5; CONCENTRATIONS; PARTICULATE MATTER; RANDOM FOREST; POLLUTION; CHINA; MODEL; CLASSIFICATION;
D O I
10.1016/j.apr.2020.09.003
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Efficient and accurate air quality predictions can contribute to public health protection and policy decision making. Fine particulate matter (PM2.5) is an important index for measuring and controlling the degree of air pollution. Recent studies have obtained satisfactory PM2.5 predictions by designing complex models or adding numerous auxiliary data sets to models, and few studies have effectively extracted the spatiotemporal features of PM2.5 time-series data. In this study, a semi-supervised model was proposed for predicting PM2.5 concentrations. The approach includes empirical mode decomposition (EMD) and bidirectional long short-term memory (BiLSTM) neural networks. This model only requires PM2.5 time-series data as inputs, which are regarded as signal data. EMD is applied as an unsupervised feature learning method to decompose the data and extract the frequency and amplitude features. This approach improved short-term trend predictions, especially for sudden changes. BiLSTM was used in the supervised learning stage. Beijing hourly and daily PM2.5 datasets collected from the China National Environmental Monitoring Centre were used to validate the prediction performance of the proposed model. The results demonstrated that this model was more accurate than the other standard LSTMbased model, with four better indicator values at the hourly (RMSE: 6.86 mu g.m(-3), MAE: 4.92 mu g.m(-3), MAPE: 10.66%, R-2: 0.989) and daily (RMSE: 22.58 mu g.m(-3), MAE: 16.67 mu g.m(-3), MAPE: 60.87%, R-2: 0.742) scales. Furthermore, this study proposed a new method of multiscale PM2.5 predictions by reconstructing hourly PM2.5 datasets to form multi-hour datasets. This method could reduce error accumulation in PM2.5 multi-step predictions using LSTM-based models and captured at least 70% of the explained variance in this study, demonstrating the feasibility of the model.
引用
收藏
页码:328 / 339
页数:12
相关论文
共 56 条
[1]   An ensemble long short-term memory neural network for hourly PM2.5 concentration forecasting [J].
Bai, Yun ;
Zeng, Bo ;
Li, Chuan ;
Zhang, Jin .
CHEMOSPHERE, 2019, 222 :286-294
[2]   Particulate Matter Air Pollution and Cardiovascular Disease An Update to the Scientific Statement From the American Heart Association [J].
Brook, Robert D. ;
Rajagopalan, Sanjay ;
Pope, C. Arden, III ;
Brook, Jeffrey R. ;
Bhatnagar, Aruni ;
Diez-Roux, Ana V. ;
Holguin, Fernando ;
Hong, Yuling ;
Luepker, Russell V. ;
Mittleman, Murray A. ;
Peters, Annette ;
Siscovick, David ;
Smith, Sidney C., Jr. ;
Whitsel, Laurie ;
Kaufman, Joel D. .
CIRCULATION, 2010, 121 (21) :2331-2378
[3]   Air pollution and health [J].
Brunekreef, B ;
Holgate, ST .
LANCET, 2002, 360 (9341) :1233-1242
[4]   Review of the governing equations, computational algorithms, and other components of the models-3 Community Multiscale Air Quality (CMAQ) modeling system [J].
Byun, Daewon ;
Schere, Kenneth L. .
APPLIED MECHANICS REVIEWS, 2006, 59 (1-6) :51-77
[5]   A practical trial of landslide detection from single-temporal Landsat8 images using contour-based proposals and random forest: a case study of national Nepal [J].
Chen, Fang ;
Yu, Bo ;
Li, Bin .
LANDSLIDES, 2018, 15 (03) :453-464
[6]   Extraction of Glacial Lake Outlines in Tibet Plateau Using Landsat 8 Imagery and Google Earth Engine [J].
Chen, Fang ;
Zhang, Meimei ;
Tian, Bangsen ;
Li, Zhen .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (09) :4002-4009
[7]   A machine learning method to estimate PM2.5 concentrations across China with remote sensing, meteorological and land use information [J].
Chen, Gongbo ;
Li, Shanshan ;
Knibbs, Luke D. ;
Hamm, N. A. S. ;
Cao, Wei ;
Li, Tiantian ;
Guo, Jianping ;
Ren, Hongyan ;
Abramson, Michael J. ;
Guo, Yuming .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 636 :52-60
[8]  
Chen Z., 2015, ADV NEUR IN, P28
[9]   Long-term trend of haze pollution and impact of particulate matter in the Yangtze River Delta, China [J].
Cheng, Zhen ;
Wang, Shuxiao ;
Jiang, Jingkun ;
Fu, Qingyan ;
Chen, Changhong ;
Xu, Bingye ;
Yu, Jianqiao ;
Fu, Xiao ;
Hao, Jiming .
ENVIRONMENTAL POLLUTION, 2013, 182 :101-110
[10]   Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases [J].
Dominici, F ;
Peng, RD ;
Bell, ML ;
Pham, L ;
McDermott, A ;
Zeger, SL ;
Samet, JM .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2006, 295 (10) :1127-1134