IWDA: Importance Weighting for Drift Adaptation in Streaming Supervised Learning Problems

被引:9
作者
Fedeli, Filippo [1 ,2 ]
Metelli, Alberto Maria [1 ,3 ]
Trovo, Francesco [1 ,3 ]
Restelli, Marcello [1 ,3 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
[2] Amazon, Barcelona 08018, Spain
[3] ML Cube, I-20134 Milan, Italy
关键词
Adaptation models; Random forests; Estimation; Detectors; Training; Monitoring; Ensemble learning; Concept drift; data drift; drift adaptation; importance weighting (IW); nonstationarity; stream learning; ONLINE; TIME;
D O I
10.1109/TNNLS.2023.3265524
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distribution drift is an important issue for practical applications of machine learning (ML). In particular, in streaming ML, the data distribution may change over time, yielding the problem of concept drift, which affects the performance of learners trained with outdated data. In this article, we focus on supervised problems in an online nonstationary setting, introducing a novel learner-agnostic algorithm for drift adaptation, namely (), with the goal of performing efficient retraining of the learner when drift is detected. incrementally estimates the joint probability density of input and target for the incoming data and, as soon as drift is detected, retrains the learner using importance-weighted empirical risk minimization. The importance weights are computed for all the samples observed so far, employing the estimated densities, thus, using all available information efficiently. After presenting our approach, we provide a theoretical analysis in the abrupt drift setting. Finally, we present numerical simulations that illustrate how competes and often outperforms state-of-the-art stream learning techniques, including adaptive ensemble methods, on both synthetic and real-world data benchmarks.
引用
收藏
页码:6813 / 6823
页数:11
相关论文
共 58 条
[1]   Just-In-Time Classifiers for Recurrent Concepts [J].
Alippi, Cesare ;
Boracchi, Giacomo ;
Roveri, Manuel .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (04) :620-634
[2]  
Basseville M., 1993, DETECTION ABRUPT CHA, V15
[3]  
Bifet A, 2010, LECT NOTES ARTIF INT, V6321, P135, DOI 10.1007/978-3-642-15880-3_15
[4]  
Bifet A, 2009, LECT NOTES COMPUT SC, V5772, P249, DOI 10.1007/978-3-642-03915-7_22
[5]  
Bifet A, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P139
[6]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[7]  
Bottou Leon., 2007, NIPS
[8]   Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm [J].
Brzezinski, Dariusz ;
Stefanowski, Jerzy .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) :81-94
[9]   A pdf-Free Change Detection Test Based on Density Difference Estimation [J].
Bu, Li ;
Alippi, Cesare ;
Zhao, Dongbin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (02) :324-334
[10]  
Chang-Hung Lee, 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P263