A Critical Review of Density-based Data Stream Clustering Techniques

被引:0
作者
Toor, Affan Ahmad [1 ]
Usman, Muhammad [1 ]
Ahmed, Waseem [2 ]
机构
[1] Shaheed Zulfikar Ali Bhutto Inst Sci & Technol, Dept Comp Sci, Karachi, Pakistan
[2] Waiariki Inst Technol, Sch Comp, Rotorua, New Zealand
来源
2016 ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM 2016) | 2016年
关键词
Data Streams; Data Stream Mining; Density-based Clustering; Density-grid clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream is relatively new and emerging domain in the current era of Internet advancement. Clustering data streams is equally important and difficult because of the numerous hurdles attached to it. A number of algorithms have been proposed to offer solutions for efficient clustering. Grid based clustering approach was adopted few years ago to overcome the limitations of conventional partition-based algorithms for data stream clustering. Data points are mapped to the grid-cells to form micro-clusters which later are used for clustering. Using density in the clustering process is proved to be a remarkable success and in recent years many researchers have used density to find arbitrary shaped & density clusters and identify outliers. Concept of density-based clustering is to use grid-based clustering at core and create a distinction between dense and sparse grids using density threshold values and use dense grids to yield clustering results; which provide more cluster purity and accuracy. In this paper, we reviewed grid based data stream clustering algorithms which utilize density. We evaluated their functionalities and identified their limitations. In the end, we critically evaluated different aspects of algorithms and suggested one of these algorithms which is better in terms of performance and accuracy.
引用
收藏
页码:51 / 61
页数:11
相关论文
共 18 条
[1]  
Amini A., 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2011), P1652, DOI 10.1109/FSKD.2011.6019867
[2]   A Multi Density-based Clustering Algorithm for Data Stream with Noise [J].
Amini, Amineh ;
Saboohi, Hadi ;
Teh, Ying Wah .
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, :1105-1112
[3]  
[Anonymous], 2003, P 29 INT C VER LARG
[4]  
Ching-Ming Chao, 2012, 2012 IEEE Workshops of International Conference on Advanced Information Networking and Applications (WAINA), P1203, DOI 10.1109/WAINA.2012.19
[5]  
Gao H., 2012, EVID-BASED COMPL ALT, V2012, P1, DOI DOI 10.1007/S10644-012-9137-0
[6]   Clustering Data Streams Based on Shared Density between Micro-Clusters [J].
Hahsler, Michael ;
Bolanos, Matthew .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (06) :1449-1461
[7]   Research on Parallel Data Stream Clustering Algorithm based on Grid and Density [J].
Hu, Weihua ;
Cheng, Mingzhong ;
Wu, Guoping ;
Wu, Liang .
2015 International Conference on Computer Science and Mechanical Automation (CSMA), 2015, :70-75
[8]  
Jiadong Ren, 2010, Proceedings 2010 1st ACIS Intl. Symp. on Cryptography & Network Security, Data Mining & Knowledge Discovery, E-Commerce & Its Applications and Embedded Systems (CDEE 2010), P212, DOI 10.1109/CDEE.2010.48
[9]  
Magdy A., 2011, Proceedings of the 2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011), P279, DOI 10.1109/ICMLA.2011.56
[10]  
Ntoutsi I., 2012, SDM, P987