Clustering Data Streams over Sliding Windows by DCA

被引:3
作者
Ta Minh Thuy [1 ]
Le Thi Hoai An [1 ,2 ]
Boudjeloud-Assala, Lydia [1 ]
机构
[1] Univ Lorraine, Lab Theoret & Appl Comp Sci, LITA EA 3097, F-57045 Metz, France
[2] Univ Lorraine, Lorraine Res Lab Comp Sci & Its Applicat, LORIA CNRS UMR 7503, F-54506 Nancy, France
来源
ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING | 2013年 / 479卷
关键词
Clustering; Data streams; Sliding windows; clustering; DCA;
D O I
10.1007/978-3-319-00293-4_6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining data stream is a challenging research area in data mining, and concerns many applications. In stream models, the data is massive and evolving continuously, it can be read only once or a small number of times. Due to the limited memory availability, it is impossible to load the entire data set into memory. Traditional data mining techniques are not suitable for this kind of model and applications, and it is required to develop new approaches meeting these new paradigms. In this paper, we are interested in clustering data stream over sliding window. We investigate an efficient clustering algorithm based on DCA (Difference of Convex functions Algorithm). Comparative experiments with clustering using the standard K-means algorithm on some real-data sets are presented.
引用
收藏
页码:65 / 75
页数:11
相关论文
共 50 条
  • [31] Clustering text data streams
    Liu, Yu-Bao
    Cai, Jia-Rong
    Yin, Jian
    Fu, Ada Wai-Chee
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2008, 23 (01) : 112 - 128
  • [32] Clustering Text Data Streams
    Yu-Bao Liu
    Jia-Rong Cai
    Jian Yin
    Ada Wai-Chee Fu
    Journal of Computer Science and Technology, 2008, 23 : 112 - 128
  • [33] Clustering categorical data streams
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    Huang, Joshua Zhexue
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2011, 11 (04) : 185 - 192
  • [34] Clustering Text Data Streams
    刘玉葆
    蔡嘉荣
    印鉴
    傅蔚慈
    Journal of Computer Science & Technology, 2008, 23 (01) : 112 - 128
  • [35] A Hybrid Clustering Algorithm for Outlier Detection in Data Streams
    Vijayarani, S.
    Jothi, P.
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (11): : 285 - 295
  • [36] A clustering approach for sampling data streams in sensor networks
    da Silva, Alzennyr
    Chiky, Raja
    Hebrail, Georges
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) : 1 - 23
  • [37] Monitoring Distributed Data Streams through Node Clustering
    Barouti, Maria
    Keren, Daniel
    Kogan, Jacob
    Malinovsky, Yaakov
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 149 - 162
  • [38] A clustering approach for sampling data streams in sensor networks
    Alzennyr da Silva
    Raja Chiky
    Georges Hébrail
    Knowledge and Information Systems, 2012, 32 : 1 - 23
  • [39] A clustering algorithm for multivariate data streams with correlated components
    Aletti G.
    Micheletti A.
    Journal of Big Data, 4 (1)
  • [40] Maintaining stream statistics over sliding windows
    Datar, M
    Gionis, A
    Indyk, P
    Motwani, R
    SIAM JOURNAL ON COMPUTING, 2002, 31 (06) : 1794 - 1813