New fuzzy-clustering algorithm for data stream

被引:0
作者
Sun, Li-Juan [1 ,2 ]
Chen, Xiao-Dong [1 ]
Han, Chong [1 ]
Guo, Jian [1 ,2 ]
机构
[1] College of Computer, Nanjing University of Posts and Telecommunications, Nanjing
[2] Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing University of Posts and Telecommunications, Nanjing
来源
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology | 2015年 / 37卷 / 07期
关键词
Data mining; Data stream; Fuzzy C-Means (FCM); Micro-clustering; Weight decay;
D O I
10.11999/JEIT141415
中图分类号
学科分类号
摘要
There is a great challenge in the data stream clustering due to a limitation of time and space. In order to solve this problem, a new fuzzy-clustering algorithm, called Weight Decay Streaming Micro Clustering (WDSMC), is presented in this paper. The algorithm uses a reformed weighted Fuzzy C-Means (FCM) algorithm, and improves the quality of clustering by the structures of micro-clusters and weight-decay. Experimental results show that this algorithm has better accuracy than Stream Weight Fuzzy C-Means (SWFCM) and StreamKM++ algorithm. ©, 2015, Science Press. All right reserved.
引用
收藏
页码:1620 / 1625
页数:5
相关论文
共 15 条
[1]  
Jonathan A.S., Elaine R.F., Rodrigo C.B., Et al., Data stream clustering: a survey, ACM Computing Surveys, 46, 1, pp. 13:1-13:31, (2013)
[2]  
Shifei D., Fulin W., Jun Q., Et al., Research on data stream clustering algorithms, Artificial Intelligence Review, 43, 4, pp. 593-600, (2013)
[3]  
Tian Z., Raghu R., Miron L., BIRCH: an efficient data clustering method for very large databases, Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 103-114, (1996)
[4]  
Aggarwal C.C., Han J., Yu P.S., A framework for clustering evolving data streams, Proceedings of the 29th Conference on Very Large Data Bases, pp. 81-92, (2003)
[5]  
Chen Y., Tu L., Density-based clustering for real-time stream data, Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133-142, (2007)
[6]  
Cao F., Ester M., Qian W., Et al., Density-based clustering over an evolving data stream with noise, Proceedings of the 16th SIAM International Conference on Data Mining, pp. 328-339, (2006)
[7]  
Ackermann M.R., Martens M., Raupach C., Et al., StreamKM++: a clustering algorithm for data streams, Journal of Experimental Algorithmics, 17, 1, pp. 2-4, (2012)
[8]  
Arthur D., Vassilvitskii S., K-means++: the advantages of careful seeding, Proceedings of the 2007 ACM-SIAM Symposium on Discrete Algorithm, pp. 1027-1035, (2007)
[9]  
Baraldi A., Blonda P., A survey of fuzzy clustering algorithms for pattern recognition, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 29, 6, pp. 778-785, (1999)
[10]  
Renxia W., Xiaoya Y., Xiaoke S., A weighted fuzzy clustering algorithm for data stream, Proceedings of the 2008 ISECS International Colloquium on Computing, Communication, Control, and Management, pp. 360-364, (2008)