Novelty detection in data streams

被引:0
作者
Elaine R. Faria
Isabel J. C. R. Gonçalves
André C. P. L. F. de Carvalho
João Gama
机构
[1] Faculty of Computing,Institute of Mathematics and Computer Science (ICMC)
[2] Federal University of Uberlândia,Laboratory of Artificial Intelligence and Decision Support (LIAAD
[3] Instituto Politécnico de Viana do Castelo,INESC TEC)
[4] University of São Paulo,undefined
[5] University of Porto,undefined
来源
Artificial Intelligence Review | 2016年 / 45卷
关键词
Novelty detection; Data streams; Survey; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
In massive data analysis, data usually come in streams. In the last years, several studies have investigated novelty detection in these data streams. Different approaches have been proposed and validated in many application domains. A review of the main aspects of these studies can provide useful information to improve the performance of existing approaches, allow their adaptation to new applications and help to identify new important issues to be addresses in future studies. This article presents and analyses different aspects of novelty detection in data streams, like the offline and online phases, the number of classes considered at each phase, the use of ensemble versus a single classifier, supervised and unsupervised approaches for the learning task, information used for decision model update, forgetting mechanisms for outdated concepts, concept drift treatment, how to distinguish noise and outliers from novelty concepts, classification strategies for data with unknown label, and how to deal with recurring classes. This article also describes several applications of novelty detection in data streams investigated in the literature and discuss important challenges and future research directions.
引用
收藏
页码:235 / 269
页数:34
相关论文
共 98 条
[1]  
Bicego M(2009)Soft clustering using weighted one-class support vector machines Pattern Recognit 42 27-32
[2]  
Figueiredo MAT(2009)Anomaly detection: a survey ACM Comput Surv 41 58-292
[3]  
Chandola V(1984)Statistical theory: the prequential approach (with discussion) J R Stat Soc A 147 278-83
[4]  
Banerjee A(2005)Learning from positive and unlabeled examples Theor Comput Sci 348 70-327
[5]  
Kumar V(2009)Adaptive concept drift detection Stat Anal Data Min 2 311-1531
[6]  
Dawid AP(2011)Incremental learning of concept drift in nonstationary environments IEEE Trans Neural Netw 22 1517-5906
[7]  
Denis F(2013)An adaptive ensemble classifier for mining concept drifting data streams Expert Syst Appl 40 5895-26
[8]  
Gilleron R(2005)Mining data streams: a review SIGMOD Rec 34 18-346
[9]  
Letouzey F(2013)On evaluating stream learning algorithms Mach Learn 90 317-588
[10]  
Dries A(2011)A survey of outlier detection methods in network anomaly identification Comput J 54 570-126