Lift-Per-Drift: An Evaluation Metric for Classification Frameworks with Concept Drift Detection

被引:1
作者
Anderson, Robert [1 ]
Koh, Yun Sing [1 ]
Dobbie, Gillian [1 ]
机构
[1] Univ Auckland, Dept Comp Sci, Auckland, New Zealand
来源
AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2018年 / 11320卷
关键词
Data streams; Concept drift; Evaluation; Classification; STREAMING DATA;
D O I
10.1007/978-3-030-03991-2_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data streams with concept drift change over time. Detecting drift allows remedial action, but this can come at a cost e.g. training a new classifier. Prequential accuracy is commonly used to evaluate the impact of drift detection frameworks on data stream classification, but recent work shows frequent periodic drift detection can provide better accuracy than state-of-the-art drift detection techniques. We discuss how sequentiality, the degree of consecutive matching class labels across instances, allows high accuracy without a classifier learning to differentiate classes. We propose a novel metric: lift-per-drift (lpd). This measures drift detection performance through its impact on classification accuracy, penalised by drifts detected in a dataset. This metric solves three problems: lpd cannot be increased by periodic, frequent drifts; lpd clearly shows when using drift detection increases classifier error; and lpd does not require knowledge of where real drifts occurred. We show how lpd can be set to be sensitive to the cost of each drift. Our experiments show lpd is not artificially increased through sequentiality; that lpd highlights when drift detection has caused a loss in accuracy; and that it is sensitive to change in true-positive drift and false-positive drift detection rates.
引用
收藏
页码:630 / 642
页数:13
相关论文
共 11 条
[1]  
[Anonymous], 2014, ACM SIGKDD explorations newsletter
[2]   Classifier Concept Drift Detection and the Illusion of Progress [J].
Bifet, Albert .
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 :715-725
[3]   Efficient Online Evaluation of Big Data Stream Classifiers [J].
Bifet, Albert ;
Morales, Gianmarco De Francisci ;
Read, Jesse ;
Holmes, Geoff ;
Pfahringer, Bernhard .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :59-68
[4]  
Bifet A, 2010, LECT NOTES ARTIF INT, V6332, P1, DOI 10.1007/978-3-642-16184-1_1
[5]   Online and Non-Parametric Drift Detection Methods Based on Hoeffding's Bounds [J].
Frias-Blanco, Isvani ;
del Campo-Avila, Jose ;
Ramos-Jimenez, Gonzalo ;
Morales-Bueno, Rafael ;
Ortiz-Diaz, Agustin ;
Caballero-Mota, Yaile .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) :810-823
[6]  
Gama J, 2004, LECT NOTES ARTIF INT, V3171, P286
[7]  
Gama J, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P329
[8]  
Hoens TR, 2012, PROG ARTIF INTELL, V1, P89, DOI 10.1007/s13748-011-0008-0
[9]  
Tsymbal A, 2004, PROBLEM CONCEPT DRIF, V106
[10]   Evaluation methods and decision theory for classification of streaming data with temporal dependence [J].
Zliobaite, Indre ;
Bifet, Albert ;
Read, Jesse ;
Pfahringer, Bernhard ;
Holmes, Geoff .
MACHINE LEARNING, 2015, 98 (03) :455-482