Double decomposition and optimal combination ensemble learning approach for interval-valued AQI forecasting using streaming data

被引:32
作者
Wang, Zicheng [1 ]
Chen, Liren [2 ]
Zhu, Jiaming [3 ]
Chen, Huayou [1 ]
Yuan, Hongjun [4 ]
机构
[1] Anhui Univ, Sch Math Sci, Hefei 230601, Peoples R China
[2] Tianjin Univ, Sch Environm Sci & Engn, Tianjin 300350, Peoples R China
[3] Anhui Univ, Sch Internet, Hefei 230039, Peoples R China
[4] Anhui Univ Finance & Econ, Sch Stat & Appl Math, Bengbu 233030, Peoples R China
基金
中国国家自然科学基金;
关键词
Air quality index; Interval forecasting; Bivariate empirical mode decomposition; Optimal combination ensemble; Seasonality; EMPIRICAL MODE DECOMPOSITION; AIR-POLLUTION SOURCES; HYBRID MODEL; MULTIOBJECTIVE OPTIMIZATION; PERFORMANCE EVALUATION; NEURAL-NETWORK; PM2.5; ALGORITHM; MORTALITY; COMMUNITY;
D O I
10.1007/s11356-020-09891-x
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
To forecast possible future environmental risks, numerous models are developed to predict the hourly values or daily averages of air pollutant concentrations using streaming data (a kind of big data collected from the Internet). On the one hand, real-time hourly data is massive and redundant, making it difficult to process. On the other hand, daily averages cannot reflect the fluctuations of air pollutant concentrations throughout the day. Therefore, a double decomposition and optimal combination ensemble learning approach is proposed for interval-valued AQI (air quality index) forecasting in this paper. In the first decomposition, considering the strong seasonal representation of AQI, the original data of each year is decomposed into four seasonal subseries on the basis of the Chinese calendar. Subsequently, we reconstruct the data of the same season in different years to get a new seasonal series to reduce the interference of seasonal changes on AQI forecasting. In the second decomposition, due to the nonlinearity and irregularity of interval-valued AQI time series, BEMD (bivariate empirical mode decomposition) is employed to decompose the interval-valued signals into a finite number of complex-valued IMF (intrinsic mode function) components and one complex-valued residue component with different frequencies to reduce the complexity of interval times series. Interval multilayer perceptron (iMLP) is utilized to model the lower bound and the upper bound simultaneously of the total components to obtain the corresponding forecasting results, which are merged to produce the final interval-valued output by an optimal combination ensemble method. Empirical study results show that the proposed model with different datasets and different forecasting horizons is significantly better than other considered models for its superior forecasting performances.
引用
收藏
页码:37802 / 37817
页数:16
相关论文
共 54 条
  • [1] Forecasting PM10 in Algiers: efficacy of multilayer perceptron networks
    Abderrahim, Hamza
    Chellali, Mohammed Reda
    Hamou, Ahmed
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2016, 23 (02) : 1634 - 1641
  • [2] [Anonymous], 2000, Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data
  • [3] Addressing Global Mortality from Ambient PM2.5
    Apte, Joshua S.
    Marshall, Julian D.
    Cohen, Aaron J.
    Brauer, Michael
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2015, 49 (13) : 8057 - 8066
  • [4] Hourly PM2.5 concentration forecast using stacked autoencoder model with emphasis on seasonality
    Bai, Yun
    Li, Yong
    Zeng, Bo
    Li, Chuan
    Zhang, Jin
    [J]. JOURNAL OF CLEANER PRODUCTION, 2019, 224 : 739 - 750
  • [5] Air pollutants concentrations forecasting using back propagation neural network based on wavelet decomposition with meteorological conditions
    Bai, Yun
    Li, Yong
    Wang, Xiaoxue
    Xie, Jingjing
    Li, Chuan
    [J]. ATMOSPHERIC POLLUTION RESEARCH, 2016, 7 (03) : 557 - 566
  • [6] Multi-step-ahead time series prediction using multiple-output support vector regression
    Bao, Yukun
    Xiong, Tao
    Hu, Zhongyi
    [J]. NEUROCOMPUTING, 2014, 129 : 482 - 493
  • [7] Bishop C., 1995, Neural networks for pattern recognition
  • [8] Designing fuzzy time series forecasting models: A survey
    Bose, Mahua
    Mali, Kalyani
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 111 : 78 - 99
  • [9] A review of artificial neural network models for ambient air pollution prediction
    Cabaneros, Sheen Mclean
    Calautit, John Kaiser
    Hughes, Ben Richard
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2019, 119 : 285 - 304
  • [10] Forecasting PM10 in the Bay of Algeciras Based on Regression Models
    Carlos Palomares-Salas, Jose
    Jose Gonzalez-de-la-Rosa, Juan
    Aguera-Perez, Agustin
    Maria Sierra-Fernandez, Jose
    Florencias-Oliveros, Olivia
    [J]. SUSTAINABILITY, 2019, 11 (04)