StreamSVC: A New Approach To Cluster Large And High-Dimensional Data Streams

被引:0
|
作者
Saberi, Hasan [1 ]
Mehdiaghaei, Mohammadali [2 ]
机构
[1] ShahidBeheshti Univ Tehran, Dept ComputerSci, Tehran, Iran
[2] Azad Univ Tehran, Dept Comp Engn, Cental Branch, Tehran, Iran
来源
WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL III | 2011年
关键词
Data stream; Clustering; SVC; Labeling piece;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The data stream mining has been studied extensively in recent years. This paper is introducing a novel method to cluster high-dimensional data streams, based on famous SVC method, named StreamSVC. SVC projects the images of the data points in a high dimensional feature space, to search for the minimal enclosing sphere, then classifies the points with respect to the distance between each point's image and the central of feature sphere. In StreamSVC, for a single change in the data stream environment, the algorithm redoes the classification part. The algorithm involves only the parts of the data set which are affected during the change of stream and updates the classes in an appropriate time complexity order. Also, in order to update the clusters, in the stream process, we used some new improvements in the labeling piece of original SVC. These improvements are applied to reduce the computational costs for classification part and the cluster's labeling piece. The experimental results show both time efficiency and high accuracy for large data streams.
引用
收藏
页码:1865 / 1870
页数:6
相关论文
共 50 条
  • [1] Approximate Cluster Heat Maps of Large High-Dimensional Data
    Rathore, Punit
    Bezdek, James C.
    Kumar, Dheeraj
    Rajasegarar, Sutharshan
    Palaniswami, Marimuthu
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 195 - 200
  • [2] Monitoring of high-dimensional and high-frequency data streams: A nonparametric approach
    Wang, Zhiqiong
    Li, Xin
    Wang, Ying
    Ma, Yanhui
    Xue, Li
    QUALITY TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2024,
  • [3] Detecting Projected Outliers in High-Dimensional Data Streams
    Zhang, Ji
    Gao, Qigang
    Wang, Hai
    Liu, Qing
    Xu, Kai
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 629 - +
  • [4] Online Pattern Mining for High-Dimensional Data Streams
    Yamamoto, Yoshitaka
    Iwanuma, Koji
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2880 - 2882
  • [5] Generalized projected clustering in high-dimensional data streams
    Wang, T
    FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 772 - 778
  • [6] Visualizing the Finer Cluster Structure of Large-Scale and High-Dimensional Data
    Liang, Yu
    Chaudhuri, Arin
    Wang, Haoyu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 361 - 372
  • [7] A new approach of subgroup identification for high-dimensional longitudinal data
    Yue, Mu
    Huang, Lei
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (11) : 2098 - 2116
  • [8] On-line monitoring data quality of high-dimensional data streams
    Qi, Dequan
    Li, Zhonghua
    Wang, Zhaojun
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (11) : 2204 - 2216
  • [9] Approximate single linkage cluster analysis of large data sets in high-dimensional spaces
    Eddy, WF
    Mockus, A
    Oue, SG
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1996, 23 (01) : 29 - 43
  • [10] Parallel Clustering of High-Dimensional Social Media Data Streams
    Gao, Xiaoming
    Ferrara, Emilio
    Qiu, Judy
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 323 - 332