A smart intelligent approach based on hybrid group search and pelican optimization algorithm for data stream clustering

被引:0
|
作者
Swathi Agarwal
C. R. K. Reddy
机构
[1] Osmania University,Department of Computer Science and Engineering
[2] CVR College of Engineering,Department of IT
[3] Mahatma Gandhi Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Data stream clustering; K-means clustering; Hybrid group search pelican optimization; Micro-cluster formation; Cluster organization; Cluster optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Big data applications generate a huge range of evolving, real-time, and high-dimensional streaming data. In many applications, data stream clustering regarding efficiency and effectiveness becomes challenging. A major issue in data mining is clustering of data streams. The several clustering techniques were implemented for stream data, but they are mostly quite restricted approaches to cluster dynamics. Generally, the data stream is an arrival of data sequence and also several factors are added in the clustering, which is rather than the classical clustering. For every data point, the stream is mostly unbounded and also the data has been estimated atleast once. It leads to higher processing time and an additional requirement on memory. In addition, the clusters in each data and their statistical property vary over time, and streams can be noisy. To address these challenges, this research work aims to implement a novel data stream clustering which is developed with a hybrid meta-heuristic model. Initially, a data stream is collected, and the micro-clusters are formed by the K-Means Clustering (KMC) technique. Then, the formation of micro-clusters, merge and sorting of the data clusters, where the cluster optimization is performed by the Hybrid Group Search Pelican Optimization (HGSPO). The main objective of the clustering is performed to maximize the accuracy through the radius, distance and similarity measures and then, the thresholds of these metrics are optimized. In the training phase, a stream of clustering threshold is fixed for each cluster. When new data comes into this stream clustering model, the output of training data is measured with new data output that is decided to forward the data into the appropriate clusters based on the assigned threshold with minimum similarity. Through the performance analysis and the attained results, the clustering quality of the recommended system is ensured regarding standard performance metrics by estimating with various clustering and heuristic algorithms.
引用
收藏
页码:2467 / 2500
页数:33
相关论文
共 50 条
  • [31] A Hybrid Intelligent Search Algorithm for Automatic Test Data Generation
    Xing, Ying
    Gong, Yun-Zhan
    Wang, Ya-Wen
    Zhang, Xu-Zhou
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [32] A hybrid data clustering approach based on improved cat swarm optimization and K-harmonic mean algorithm
    Kumar, Yugal
    Sahoo, G.
    AI COMMUNICATIONS, 2015, 28 (04) : 751 - 764
  • [33] Data fusion based on hybrid intelligent optimization
    Song, Wei
    Chen, Zhimin
    Wang, Chaochen
    Journal of Information and Computational Science, 2013, 10 (14): : 4611 - 4618
  • [34] Research on Hybrid Data Clustering Algorithm for Wireless Communication Intelligent Bracelets
    Sun, Jian-zhao
    Yang, Kun
    Wozniak, Marcin
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1762 - 1771
  • [35] Data clustering algorithm based on digital search tree
    Zhou, XH
    Wang, HB
    Zhou, DR
    Meng, B
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1757 - 1761
  • [36] A Hybrid Intelligent Optimization Algorithm Based on a Learning Strategy
    Deng, Wanyi
    Ma, Xiaoxue
    Qiao, Weiliang
    MATHEMATICS, 2024, 12 (16)
  • [37] An intelligent optimization algorithm based on hybrid of GA and PSO
    Ma, Chao
    Deng, Chao
    Xiong, Yao
    Wu, Jun
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (11): : 2278 - 2286
  • [38] Particle swarm optimization based hybrid intelligent algorithm
    Zhang, QL
    Li, X
    Tran, QA
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1648 - 1650
  • [39] A Density Granularity Grid Clustering Algorithm Based on Data Stream
    Wang, Li-fang
    Han, Xie
    EMERGING RESEARCH IN WEB INFORMATION SYSTEMS AND MINING, 2011, 238 : 113 - 120
  • [40] A data stream subspace clustering algorithm based on region partition
    Yu, X. (yuxpointfly@gmail.com), 1600, Science Press (51):