An Incremental Algorithm Based on Irregular Grid for Clustering Data Stream

被引:0
作者
Yin, Guisheng [1 ]
Yu, Xiang [1 ]
Yang, Guang [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
来源
2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31 | 2008年
关键词
data stream; clustering; grid; data mining;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The CluStream algorithm can not process arbitrary shapes of clusters well. Having analyzed the nature of data stream, proposed the incremental clustering algorithm based on irregular grid in data stream, namely IIGStream. IIGStream has the advantage of clustering fast which traditional grid clustering algorithms has, and it adjust the structure of grid dynamically. To new coming data points, it promises the effectiveness of clustering of different cluster shapes by judging whether the grids are connected. During the process of clustering, it need not specify the number of clusters in advance, and it is not sensitive to outliers. The experiments on real datasets and synthetic datasets show the applicability and validity of IIGStream.
引用
收藏
页码:5680 / 5684
页数:5
相关论文
共 12 条
[1]  
Aggarwal C. C., 2004, P 30 INT C VER LARG, V30, P852, DOI DOI 10.1016/B978-012088469-8.50075-9
[2]  
Agrawal R., 1998, ACM SIGMOD C
[3]  
[Anonymous], 2003, VLDB C
[4]  
[Anonymous], 2002, VERY LARGE DATA BASE
[5]  
Babcock B., 2002, Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), P1, DOI DOI 10.1145/543613.543615
[6]  
Datar M, 2002, SIAM PROC S, P635
[7]  
Golab L, 2003, SIGMOD REC, V32, P5, DOI 10.1145/776985.776986
[8]   Clustering data streams [J].
Guha, S ;
Mishra, N ;
Motwani, R ;
O'Callaghan, L .
41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2000, :359-366
[9]   Streaming-data algorithms for high-quality clustering [J].
O'Callaghan, L ;
Mishra, N ;
Meyerson, A ;
Guha, S ;
Motwani, R .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :685-694
[10]  
Zhang T., 1996, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, V25, P103, DOI [DOI 10.1145/235968.233324, 10.1145/235968.233324]