Evolving data stream clustering algorithm based on the shared nearest neighbor density

被引:0
作者
Gao, Bing [1 ,2 ]
Zhang, Jian-Pei [1 ]
Zou, Qi-Jie [1 ,2 ]
机构
[1] College of Computer Science and Technology, Harbin Engineering University, Harbin
[2] Department of Computer, Dalian Neusoft Information College, Dalian
来源
Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing | 2014年 / 36卷 / 12期
关键词
Clustering algorithms; Data mining; Data streams; Nearest neighbors; Outliers;
D O I
10.13374/j.issn1001-053x.2014.12.018
中图分类号
学科分类号
摘要
Existing density-based data stream clustering algorithms are difficult to discover clusters with different densities and to distinguish clusters with bridges and the outliers. A novel stream clustering algorithm was proposed based on the shared nearest neighbor density. In this algorithm, the shared nearest neighbor density was defined based on the shared nearest neighbor graph, which considered the degree of data object surrounded by the nearest neighbors and the degree of data object demanded by around data objects. So the clustering result was not influenced by the density variation. The average distance of data object and the cluster density were defined to identify outliers and clusters with bridges. The updating algorithm over the sliding window was designed to maintain the renewal of clusters on the shared nearest neighbor graph. Theoretical analysis and experimental results demonstrate the performance of clustering effect and a better clustering quality. ©, 2014, University of Science and Technology Beijing. All right reserved.
引用
收藏
页码:1703 / 1711
页数:8
相关论文
共 50 条
[21]   Data stream clustering algorithm based on spatial directed graph [J].
Ren, Jiadong ;
Yuan, Xuemei ;
Dong, Jun .
Advances in Information Sciences and Service Sciences, 2012, 4 (20) :241-249
[22]   Incremental clustering algorithm based on rough reduction for data stream [J].
College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China .
Xinan Jiaotong Daxue Xuebao, 2009, 5 (637-643+653) :637-643+653
[23]   A fast density-based data stream clustering algorithm with cluster centers self-determined for mixed data [J].
Chen, Jin-Yin ;
He, Hui-Hao .
INFORMATION SCIENCES, 2016, 345 :271-293
[24]   Quick clustering algorithm for wargaming data based on density [J].
Shi C.-L. ;
Zhang M.-J. ;
Wu L. ;
Tang Y.-B. ;
Jing M. .
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2011, 33 (11) :2428-2433
[25]   A Critical Review of Density-based Data Stream Clustering Techniques [J].
Toor, Affan Ahmad ;
Usman, Muhammad ;
Ahmed, Waseem .
2016 ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM 2016), 2016, :51-61
[26]   A NEW K-NEAREST NEIGHBOR DENSITY-BASED CLUSTERING METHOD AND ITS APPLICATION TO HYPERSPECTRAL IMAGES [J].
Cariou, Claude ;
Chehdi, Kacem .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :6161-6164
[27]   Hyper-ellipsoidal clustering technique for evolving data stream [J].
Rehman, Muhammad Zia-ur ;
Li, Tianrui ;
Yang, Yan ;
Wang, Hongjun .
KNOWLEDGE-BASED SYSTEMS, 2014, 70 :3-14
[28]   Incremental clustering algorithm of data stream based on artificial immune network [J].
Yue, Xun ;
Chi, Zhongxian ;
Hao, Yanyou ;
Mo, Hongwei ;
Yue, Xun .
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, :4021-+
[29]   Incremental density-based ensemble clustering over evolving data streams [J].
Khan, Imran ;
Huang, Joshua Z. ;
Ivanov, Kamen .
NEUROCOMPUTING, 2016, 191 :34-43
[30]   Clustering Data Streams Based on Shared Density between Micro-Clusters [J].
Hahsler, Michael ;
Bolanos, Matthew .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (06) :1449-1461