A smart intelligent approach based on hybrid group search and pelican optimization algorithm for data stream clustering

被引:0
|
作者
Swathi Agarwal
C. R. K. Reddy
机构
[1] Osmania University,Department of Computer Science and Engineering
[2] CVR College of Engineering,Department of IT
[3] Mahatma Gandhi Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Data stream clustering; K-means clustering; Hybrid group search pelican optimization; Micro-cluster formation; Cluster organization; Cluster optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Big data applications generate a huge range of evolving, real-time, and high-dimensional streaming data. In many applications, data stream clustering regarding efficiency and effectiveness becomes challenging. A major issue in data mining is clustering of data streams. The several clustering techniques were implemented for stream data, but they are mostly quite restricted approaches to cluster dynamics. Generally, the data stream is an arrival of data sequence and also several factors are added in the clustering, which is rather than the classical clustering. For every data point, the stream is mostly unbounded and also the data has been estimated atleast once. It leads to higher processing time and an additional requirement on memory. In addition, the clusters in each data and their statistical property vary over time, and streams can be noisy. To address these challenges, this research work aims to implement a novel data stream clustering which is developed with a hybrid meta-heuristic model. Initially, a data stream is collected, and the micro-clusters are formed by the K-Means Clustering (KMC) technique. Then, the formation of micro-clusters, merge and sorting of the data clusters, where the cluster optimization is performed by the Hybrid Group Search Pelican Optimization (HGSPO). The main objective of the clustering is performed to maximize the accuracy through the radius, distance and similarity measures and then, the thresholds of these metrics are optimized. In the training phase, a stream of clustering threshold is fixed for each cluster. When new data comes into this stream clustering model, the output of training data is measured with new data output that is decided to forward the data into the appropriate clusters based on the assigned threshold with minimum similarity. Through the performance analysis and the attained results, the clustering quality of the recommended system is ensured regarding standard performance metrics by estimating with various clustering and heuristic algorithms.
引用
收藏
页码:2467 / 2500
页数:33
相关论文
共 50 条
  • [21] An evaluation of k-means as a local search operator in hybrid memetic group search optimization for data clustering
    Luciano D. S. Pacifico
    Teresa B. Ludermir
    Natural Computing, 2021, 20 : 611 - 636
  • [22] An evaluation of k-means as a local search operator in hybrid memetic group search optimization for data clustering
    Pacifico, Luciano D. S.
    Ludermir, Teresa B.
    NATURAL COMPUTING, 2021, 20 (03) : 611 - 636
  • [23] A Support Vector and K-Means Based Hybrid Intelligent Data Clustering Algorithm
    Sun, Liang
    Yoshida, Shinichi
    Liang, Yanchun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (11) : 2234 - 2243
  • [24] Research on Data Stream Clustering Based on FCM Algorithm
    Gao, Tiancheng
    Li, Aihua
    Meng, Fan
    5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 595 - 602
  • [25] IMPROVED DENSITY BASED ALGORITHM FOR DATA STREAM CLUSTERING
    Mousavi, Maryam
    Abu Bakar, Azuraliza
    JURNAL TEKNOLOGI, 2015, 77 (18): : 73 - 77
  • [26] THE CLUSTERING ALGORITHM OF EVOLUTIONAL DATA STREAM BASED ON DENSITY
    Meng, Yuyu
    Zheng, Liying
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 473 - 477
  • [27] Clustering Algorithm Based on Grid and Density for Data Stream
    Wang, Lang
    Li, Haiqing
    MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
  • [28] Drifted Data Stream Clustering Based on ClusTree Algorithm
    Zgraja, Jakub
    Wozniak, Michal
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 338 - 349
  • [29] Application Research of Intelligent Search System Based on Clustering Algorithm
    Liu, Tao
    Hao, Lihong
    Huang, Saixiao
    Qin, Xiaoya
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 1362 - 1373
  • [30] A hybrid group search optimization: firefly algorithm-based big data framework for ancient script recognition
    Suganya, T. S.
    Murugavalli, S.
    SOFT COMPUTING, 2020, 24 (14) : 10933 - 10941