Change detection in learning histograms from data streams

被引:0
作者
Sebastiao, Raquel [1 ]
Gama, Joao [1 ]
机构
[1] Univ Porto, LIAAD INESC Porto LA, P-4050190 Oporto, Portugal
来源
PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 2007年 / 4874卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we study the problem of constructing histograms from high-speed time-changing data streams. Learning in this context requires the ability to process examples once at the rate they arrive, maintaining a histogram consistent with the most recent data, and forgetting out-date data whenever a change in the distribution is detected. To construct histogram from high-speed data streams we use the two layer structure used in the Partition Incremental Discretization (PiD) algorithm. Our contribution is a new method to detect whenever a change in the distribution generating examples occurs. The base idea consists of monitoring distributions from two different time windows: the reference time window, that reflects the distribution observed in the past; and the current time window reflecting the distribution observed in the most recent data. We compare both distributions and signal a change whenever they are greater than a threshold value, using three different methods: the Entropy Absolute Difference, the Kullback-Leibler divergence and the Cosine Distance. The experimental results suggest that Kullback-Leibler divergence exhibit high probability in change detection, faster detection rates, with few false positives alarms.
引用
收藏
页码:112 / 123
页数:12
相关论文
共 13 条
[1]  
[Anonymous], INTELLIGENT DATA ANA
[2]  
CORREA M, 2007, 7 C COL ASS AUT CAL
[3]  
DASU T, 2006, INT 2006
[4]  
DOMINGOS P, 2002, ADV NEURAL INFORM PR, V14
[5]  
Gama J., 2006, Applied Computing 2006. 21st Annual ACM Symposium on Applied Computing, P662, DOI 10.1145/1141277.1141429
[6]  
Johnson D., 2004, P 2004 VLDB C, P13
[7]  
Klinkenberg R., 2004, Intelligent Data Analysis, V8, P281
[8]  
Klinkenberg R, 2000, P 17 INT C MACH LEAR, P487, DOI DOI 10.1007/978-3-540-44871-6_130
[9]  
KLINKENBERG R, 1998, LEARNING TEXT CATEGO, P33
[10]   Selecting examples for partial memory learning [J].
Maloof, MA ;
Michalski, RS .
MACHINE LEARNING, 2000, 41 (01) :27-52