Clustering Data Streams over Sliding Windows by DCA

被引:3
|
作者
Ta Minh Thuy [1 ]
Le Thi Hoai An [1 ,2 ]
Boudjeloud-Assala, Lydia [1 ]
机构
[1] Univ Lorraine, Lab Theoret & Appl Comp Sci, LITA EA 3097, F-57045 Metz, France
[2] Univ Lorraine, Lorraine Res Lab Comp Sci & Its Applicat, LORIA CNRS UMR 7503, F-54506 Nancy, France
来源
ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING | 2013年 / 479卷
关键词
Clustering; Data streams; Sliding windows; clustering; DCA;
D O I
10.1007/978-3-319-00293-4_6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining data stream is a challenging research area in data mining, and concerns many applications. In stream models, the data is massive and evolving continuously, it can be read only once or a small number of times. Due to the limited memory availability, it is impossible to load the entire data set into memory. Traditional data mining techniques are not suitable for this kind of model and applications, and it is required to develop new approaches meeting these new paradigms. In this paper, we are interested in clustering data stream over sliding window. We investigate an efficient clustering algorithm based on DCA (Difference of Convex functions Algorithm). Comparative experiments with clustering using the standard K-means algorithm on some real-data sets are presented.
引用
收藏
页码:65 / 75
页数:11
相关论文
共 50 条
  • [21] Clustering data streams: Theory and practice
    Guha, S
    Meyerson, A
    Mishra, N
    Motwani, R
    O'Callaghan, L
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 515 - 528
  • [22] RLC: ranking lag correlations with flexible sliding windows in data streams
    Shanshan Wu
    Huaizhong Lin
    Wenxiang Wang
    Dongming Lu
    Leong Hou U
    Yunjun Gao
    Pattern Analysis and Applications, 2017, 20 : 601 - 611
  • [23] RLC: ranking lag correlations with flexible sliding windows in data streams
    Wu, Shanshan
    Lin, Huaizhong
    Wang, Wenxiang
    Lu, Dongming
    U, Leong Hou
    Gao, Yunjun
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (02) : 601 - 611
  • [24] Mining Frequent Item Sets in Asynchronous Transactional Data Streams over Time Sensitive Sliding Windows Model
    Javaid, Qaisar
    Memon, Farida
    Talpur, Shahnawaz
    Arif, Muhammad
    Awan, Muhammad Daud
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2016, 35 (04) : 625 - 644
  • [25] Regular Expression Pattern Matching with Sliding Windows over Probabilistic Event Streams
    Sugiura, Kento
    Ishikawa, Yoshiharu
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 103 - 110
  • [26] SPECTRA: Continuous Query Processing for RDF Graph Streams Over Sliding Windows
    Gillani, Syed
    Picard, Gauthier
    Laforest, Frederique
    28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,
  • [27] Ensemble Clustering for Novelty Detection in Data Streams
    Garcia, Kemilly Dearo
    de Faria, Elaine Ribeiro
    de Sa, Claudio Rebelo
    Mendes-Moreira, Joao
    Aggarwal, Charu C.
    de Carvalho, Andre C. P. L. F.
    Kok, Joost N.
    DISCOVERY SCIENCE (DS 2019), 2019, 11828 : 460 - 470
  • [28] Suggested Techniques for Clustering and Mining of Data Streams
    Anuradha, G.
    Roy, Bidisha
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 265 - 270
  • [29] Clustering Data Streams: A Complex Network Approach
    Porto, Sandy
    Quiles, Marcos G.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2019, PT I: 19TH INTERNATIONAL CONFERENCE, SAINT PETERSBURG, RUSSIA, JULY 1-4, 2019, PROCEEDINGS, PT I, 2019, 11619 : 52 - 65
  • [30] Clustering Text Data Streams
    刘玉葆
    蔡嘉荣
    印鉴
    傅蔚慈
    Journal of Computer Science & Technology, 2008, 23 (01) : 112 - 128