Data stream clustering algorithm based on spatial directed graph

被引:0
作者
Ren, Jiadong [1 ,2 ]
Yuan, Xuemei [1 ,2 ]
Dong, Jun [1 ,2 ]
机构
[1] College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004, China
[2] The Key Laboratory for Computer Virtual Technology and, System Integration of Hebei Province, Qinhuangdao City, 066004, China
来源
Advances in Information Sciences and Service Sciences | 2012年 / 4卷 / 20期
关键词
Clustering algorithms;
D O I
10.4156/AISS.vol4.issue20.29
中图分类号
学科分类号
摘要
Recently, the grid-density based clustering has become one of the major issues among all of the clustering approaches, it has special advantages over other clustering algorithms, such as less computation and the ability of clustering with arbitrarily shape, which are particularly useful for the data stream clustering. This paper defines a spatial directed graph named Grid-Based Graph (GBG) to store the non-empty grids in data space, and proposes a data stream clustering algorithm based on spatial directed graph GBGSClu (Grid-Based Graph Stream Clustering). GBG graph composes of vertices and directed edges, if a vertex A has a neighboring dense vertex B, and then there is a directed edge from vertex B to A in GBG. The algorithm maps the data stream into the non-empty vertices online, updates the vertices' feature vectors with the arriving of data stream, deletes the sparse vertices every gap time, generates GBG graph when the clustering quest coming and finally clusters on the current structure. The eventual clustering results can be obtained by only checking the vertices' in-degree which can reduce the computation needed in clustering. The validity and efficiency of GBGSClu algorithm have been tested and verified by clustering on real and synthetic datasets.
引用
收藏
页码:241 / 249
相关论文
empty
未找到相关数据