Efficient Online Evaluation of Big Data Stream Classifiers

被引：115

作者：

Bifet, Albert ^{[1
]}

Morales, Gianmarco De Francisci ^{[2
]}

Read, Jesse ^{[3
]}

Holmes, Geoff ^{[4
]}

Pfahringer, Bernhard ^{[4
]}

机构：

[1] HUAWEI, Noahs Ark Lab, Hong Kong, Peoples R China

[2] Aalto Univ, Helsinki, Finland

[3] Aalto Univ, HIIT, Helsinki, Finland

[4] Univ Waikato, Hamilton, New Zealand

来源：

KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2015年

关键词：

Data Streams; Evaluation; Online Learning; Classification; CLASSIFICATION; AGREEMENT;

D O I：

10.1145/2783258.2783372

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The evaluation of classifiers in data streams is fundamental so that poorly-performing models can be identified, and either improved or replaced by better-performing models. This is an increasingly relevant and important task as stream data is generated from more sources, in real-time, in large quantities, and is now considered the largest source of big data. Both researchers and practitioners need to be able to effectively evaluate the performance of the methods they employ. However, there are major challenges for evaluation in a stream. Instances arriving in a data stream are usually time-dependent, and the underlying concept that they represent may evolve over time. Furthermore, the massive quantity of data also tends to exacerbate issues such as class imbalance. Current frameworks for evaluating streaming and online algorithms are able to give predictions in real-time, but as they use a prequential setting, they build only one model, and are thus not able to compute the statistical significance of results in real-time. In this paper we propose a new evaluation methodology for big data streams. This methodology addresses unbalanced data streams, data where change occurs on different time scales, and the question of how to split the data between training and testing, over multiple models.

引用

页码：59 / 68

页数：10

共 26 条

[1]

[Anonymous], 2011, BIG DATA NEXT FRONTI

[2]

[Anonymous], 2007, SDM

[3]

[Anonymous], 2014, Evaluating Learning Algorithms A Classification Perspective, DOI DOI 10.1017/CBO9780511921803

[4]

[Anonymous], 2003, P 20 INT C INT C MAC, DOI DOI 10.5555/3041838.3041845

[5]

Bifet A., 2010, Journal of Machine Learning Research (JMLR)

[6]

Bifet A, 2010, LECT NOTES ARTIF INT, V6321, P135, DOI 10.1007/978-3-642-15880-3_15

[7]

Blum A., 1999, Proceedings of the Twelfth Annual Conference on Computational Learning Theory, P203, DOI 10.1145/307400.307439

[8]

Breiman L, 1984, OLSHEN STONE CLASSIF, DOI 10.1201/9781315139470

[9] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].

COHEN, J .

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46

[10]

Morales GD, 2015, J MACH LEARN RES, V16, P149

← 1 2 3 →