Several novel evaluation measures for rank-based ensemble pruning with applications to time series prediction

被引:25
作者
Ma, Zhongchen [1 ]
Dai, Qun [1 ]
Liu, Ningzhong [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble pruning; Time series prediction; Rank-based ensemble pruning; Complementarity measure for time series prediction (ComTSP); Concurrency thinning for time series prediction (ConTSP); Reduce Error pruning for time series prediction (ReTSP-Trend); Time window size;
D O I
10.1016/j.eswa.2014.07.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble pruning is a desirable and popular method to overcome the deficiency of high computational costs of traditional ensemble learning techniques. Among various of ensemble pruning methods, rank-based pruning is conceptually the simplest and possesses performance advantage. While four evaluation measures for rank-based ensemble pruning specifically for time series prediction are proposed by us in this paper. The first one, i.e. Complementarity measure for time series prediction (ComTSP), is properly modified from Complementarity measure (COM) for classification. The design idea of ComTSP is, if the error made by the subensemble for a pruning sample is larger than that by the candidate predictor to a certain extent, it is assumed that the predictor is complementary to the subensemble. And the predictor which minimizes the error rate of subensemble on the pruning set will be selected at each selection step. The second one, i.e. Concurrency thinning for time series prediction (ConTSP), is correctly transformed from Concurrency measure (CON) for classification. With ConTSP, a predictor is rewarded for obtaining a good performance, and rewarded more for obtaining a good performance when the subensemble performs badly. A predictor is penalized when both the subensemble and itself perform poorly. The measure ReTSP-Value is specifically designed for Reduce Error (RE) pruning for time series prediction. However, ReTSP-Value and ComTSP have the same flaw that, they could not guarantee the remaining predictor which supplements the subensemble the most will be selected. The cause of this flaw is that the predictive error in time series prediction is directional. It is not reasonable for these measures to take reducing error as the only goal while ignore the error direction. While our finally proposed measure ReTSP-Trend overcomes this defect, taking into consideration the trend of time series and the direction of forecasting error. It could indeed guarantee that the remaining predictor which supplements the subensemble the most will be selected. The comparison experiments on four benchmark financial time series datasets show that the measure ReTSP-Trend outperforms the other measures, which can remarkably improve the predictive ability and promote the generalization capability of the pruned ensembles for time series forecasting. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:280 / 292
页数:13
相关论文
共 27 条
  • [1] Introduction to financial forecasting
    AbuMostafa, YS
    Atiya, AF
    [J]. APPLIED INTELLIGENCE, 1996, 6 (03) : 205 - 213
  • [2] Amorim Neto Manoel C., 2009, Proceedings 2009 International Joint Conference on Neural Networks (IJCNN 2009 - Atlanta), P149, DOI 10.1109/IJCNN.2009.5178949
  • [3] [Anonymous], J BUSINESS EC STAT
  • [4] [Anonymous], ROOT MEAN SQUARE DEV
  • [5] A new boosting algorithm for improved time-series forecasting with recurrent neural networks
    Assaad, Mohammad
    Bone, Romuald
    Cardot, Hubert
    [J]. INFORMATION FUSION, 2008, 9 (01) : 41 - 55
  • [6] Banfield R. E., 2005, Information Fusion, V6, P49, DOI 10.1016/j.inffus.2004.04.005
  • [7] Basak D., 2007, Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, V11, P203, DOI DOI 10.1007/978-1-4302-5990-9_4
  • [8] Caruana R., 2004, P 21 INT C MACH LEAR, DOI [10.1145/1015330.1015432, DOI 10.1145/1015330.1015432]
  • [9] Empirical analysis and evaluation of approximate techniques for pruning regression bagging ensembles
    Hernandez-Lobato, Daniel
    Martinez-Munoz, Gonzalo
    Suarez, Alberto
    [J]. NEUROCOMPUTING, 2011, 74 (12-13) : 2250 - 2264
  • [10] A new class of hybrid models for time series forecasting
    Khashei, Mehdi
    Bijari, Mehdi
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (04) : 4344 - 4357