An on-line weighted ensemble of regressor models to handle concept drifts

被引:60
作者
Soares, Symone Gomes [1 ]
Araujo, Rui
机构
[1] Univ Coimbra, Inst Syst & Robot, PT-3030290 Coimbra, Portugal
关键词
Concept drift; Ensemble learning; Learning in changing environments; Regression; Ensemble pruning strategies; NEURAL-NETWORK; PREDICTION; MIXTURE; SYSTEM; SENSOR;
D O I
10.1016/j.engappai.2014.10.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many estimation, prediction, and learning applications have a dynamic nature. One of the most important challenges in machine learning is dealing with concept changes. Underlying changes may make the model designed on old data, inconsistent with new data. Also, algorithms usually specialize in one type of change. Other challenge is reusing previously acquired information in scenarios where changes may recur. This strategy improves the learning accuracy and reduces the processing time. Unfortunately, most existing learning algorithms to deal with changes are adapted on a batch basis. This process usually requires a long time, and such data may not reflect the current state of the system. However, even the system is adapted on a sample basis, existing algorithms may adapt slowly to changes and cannot conciliate old and new information. This paper proposes an On-line Weighted Ensemble (OWE) of regressor models which is able to learn incrementally sample by sample in the presence of several types of changes and simultaneously retain old information in recurring scenarios. The key idea is to keep a moving window that slides when a new sample is available. The error of each model on the current window is determined using a boosting strategy that assigns small errors to the models that predict accurately the samples predicted poorly by the ensemble. To handle recurring and non-recurring changes, OWE uses a new assignment of models' weights that takes into account the models' errors on the past and current windows using a discounting factor that decreases or increases the contribution of old windows. In addition, OWE launches new models if the system's accuracy is decreasing, and it can exclude inaccurate models over time. Experiments with artificial and industrial data reveal that in most cases OWE outperforms other state-of-the-art concept drift approaches. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:392 / 406
页数:15
相关论文
共 49 条
[1]   A recursive PLS-based soft sensor for prediction of the melt index during grade change operations in HDPE plant [J].
Ahmed, Faisal ;
Nazir, Salman ;
Yeo, Yeong Koo .
KOREAN JOURNAL OF CHEMICAL ENGINEERING, 2009, 26 (01) :14-20
[2]  
[Anonymous], 2005, ACM International Conference Proceeding Series, DOI [DOI 10.1145/1102351.1102408, 10.1145/1102351.1102408]
[3]  
[Anonymous], THESIS TRINITY COLL
[4]  
[Anonymous], 2006, 4 INT WORKSHOP KNOWL
[5]   Variable window adaptive Kernel Principal Component Analysis for nonlinear nonstationary process monitoring [J].
Ben Khediri, Issam ;
Limam, Mohamed ;
Weihs, Claus .
COMPUTERS & INDUSTRIAL ENGINEERING, 2011, 61 (03) :437-446
[6]   Enhancing data stream predictions with reliability estimators and explanation [J].
Bosnic, Zoran ;
Demsar, Jaka ;
Kespret, Grega ;
Rodrigues, Pedro Pereira ;
Gama, Joao ;
Kononenko, Igor .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 :178-192
[7]   Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm [J].
Brzezinski, Dariusz ;
Stefanowski, Jerzy .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) :81-94
[8]   Neural network ensembles based on copula methods and Distributed Multiobjective Central Force Optimization algorithm [J].
Chao, Meng ;
Xin, Sun Zhi ;
Min, Liu San .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 32 :203-212
[9]   Application of a PSO-based neural network in analysis of outcomes of construction claims [J].
Chau, K. W. .
AUTOMATION IN CONSTRUCTION, 2007, 16 (05) :642-646
[10]  
Cheng CT, 2005, LECT NOTES COMPUT SC, V3498, P1040