AnyNovel: detection of novel concepts in evolving data streams

被引:42
作者
Abdallah, Zahraa S. [1 ]
Gaber, Mohamed Medhat [2 ]
Srinivasan, Bala [1 ]
Krishnaswamy, Shonali [3 ]
机构
[1] Monash Univ, Fac Informat Technol, Melbourne, Vic 3004, Australia
[2] Robert Gordon Univ, Sch Comp Sci & Digital Media, Aberdeen AB9 1FR, Scotland
[3] Inst Infocomm Res I2R, Singapore, Singapore
关键词
Stream mining; Concept evolution; Activity recognition; Continuous learning; Active learning; Novelty detection;
D O I
10.1007/s12530-016-9147-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A data stream is a flow of unbounded data that arrives continuously at high speed. In a dynamic streaming environment, the data changes over the time while stream evolves. The evolving nature of data causes essentially the appearance of new concepts. This novel concept could be abnormal such as fraud, network intrusion, or a sudden fall. It could also be a new normal concept that the system has not seen/trained on before. In this paper we propose, develop, and evaluate a technique for concept evolution in evolving data streams. The novel approach continuously monitors the movement of the streaming data to detect any emerging changes. The technique is capable of detecting the emergence of any novel concepts whether they are normal or abnormal. It also applies a continuous and active learning for assimilating the detected concepts in real time. We evaluate our approach on activity recognition domain as an application of evolving data streams. The study of the novel technique on benchmarked datasets showed its efficiency in detecting new concepts and continuous adaptation with low computational cost.
引用
收藏
页码:73 / 93
页数:21
相关论文
共 41 条
[31]   A review of novelty detection [J].
Pimentel, Marco A. F. ;
Clifton, David A. ;
Clifton, Lei ;
Tarassenko, Lionel .
SIGNAL PROCESSING, 2014, 99 :215-249
[32]   Incremental local outlier detection for data streams [J].
Pokrajac, Dragojub ;
Lazarevic, Aleksandar ;
Latecki, Longin Jan .
2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, :504-515
[33]   Activity identification using body-mounted sensors-a review of classification techniques [J].
Preece, Stephen J. ;
Goulermas, John Y. ;
Kenney, Laurence P. J. ;
Howard, Dave ;
Meijer, Kenneth ;
Crompton, Robin .
PHYSIOLOGICAL MEASUREMENT, 2009, 30 (04) :R1-R33
[34]  
Rashidi P., 2010, Proceedings 2010 10th IEEE International Conference on Data Mining (ICDM 2010), P431, DOI 10.1109/ICDM.2010.40
[35]  
Roggen D, 2009, WORLD WIR MOB MULT N, P1, DOI DOI 10.1109/WOWMOM.2009.5282442
[36]  
Schlimmer J. C., 1986, Machine Learning, V1, P317, DOI 10.1007/BF00116895
[37]   Fully unsupervised fault detection and identification based on recursive density estimation and self-evolving cloud-based classifier [J].
Sielly Jales Costa, Bruno ;
Angelov, Plamen Parvanov ;
Guedes, Luiz Affonso .
NEUROCOMPUTING, 2015, 150 :289-303
[38]  
Spinosa EJ, 2007, APPLIED COMPUTING 2007, VOL 1 AND 2, P448, DOI 10.1145/1244002.1244107
[39]  
Witten Ian H, 2005, DATA MINING PRACTICA
[40]  
Yang Y., 2002, P 8 ACM SIGKDD INT C, P688, DOI DOI 10.1145/775047.775150