A cost-sensitive active learning algorithm: toward imbalanced time series forecasting

被引:0
作者
Jing Zhang
Qun Dai
机构
[1] Nanjing University of Aeronautics and Astronautics,College of Computer Science and Technology
来源
Neural Computing and Applications | 2022年 / 34卷
关键词
Time series forecasting (TSF); Data imbalance; Cost-sensitive learning (CSL); Active learning (AL);
D O I
暂无
中图分类号
学科分类号
摘要
Recently, many outstanding techniques for Time series forecasting (TSF) have been proposed. These techniques depend on necessary and sufficient data samples, which is the key to train a good predictor. Thus, an Active learning (AL) algorithmic framework based on Support vector regression (SVR) is designed for TSF, with the goal to choose the most valuable samples and reduce the complexity of the training set. To evaluate the quality of samples comprehensively, multiple essential criteria, such as informativeness, representativeness and diversity, are considered in a two clustering-based consecutive stages procedure. In addition, considering the imbalance of time series data, a range of values might be seriously under-represented but extremely important to the user. Thus, it is unreasonable to assign the same prediction cost to each sample. To address this imbalance problem, a multiple criteria cost-sensitive active learning algorithm in the virtue of weight SVR architecture, abbreviated as MAW-SVR, ad hoc for imbalanced TSF, is proposed. By introducing the cost-sensitive scheme, each sample is endowed with a penalty weight, which can be dynamically updated in the AL procedure. The experimental comparisons between MAW-SVR and the other six AL algorithms on a total of thirty time series datasets verify the effectiveness of the proposed algorithm.
引用
收藏
页码:6953 / 6972
页数:19
相关论文
共 99 条
  • [1] Contreras-Reyes JE(2020)Backcasting and forecasting time series using detrended cross-correlation analysis Physica A-Stat Mechan Appl 560 125109-291
  • [2] Idrovo-Aguirre BJ(2019)Nonstationary time series transformation methods: An experimental review Knowl-Based Syst 164 274-473
  • [3] Salles R(2006)25 years of time series forecasting Int J Forecast 22 443-86
  • [4] Belloze K(2019)An intelligent hybridization of ARIMA with machine learning models for time series forecasting Knowl-Based Syst 175 72-6077
  • [5] Porto F(2019)A novel double incremental learning algorithm for time series prediction Neural Comput Appl 31 6055-593
  • [6] Gonzalez PH(2012)Application of seasonal SVR with chaotic immune algorithm in traffic flow forecasting Neural Comput Appl 21 583-1491
  • [7] Ogasawara E(2018)Non-tuned machine learning approach for hydrological time series forecasting Neural Comput Appl 30 1479-20
  • [8] Hyndman RJ(2013)Time series forecasting by evolving artificial neural networks with genetic algorithms, differential evolution and estimation of distribution algorithm Neural Comput Appl 22 11-105
  • [9] De Gooijer JG(2002)Weighted least squares support vector machines: robustness and sparse approximation Neurocomputing 48 85-945
  • [10] Junior DSDOS(2020)Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey J Comput Sci Technol 35 913-2894