A smart intelligent approach based on hybrid group search and pelican optimization algorithm for data stream clustering

被引：0

作者：

Swathi Agarwal

C. R. K. Reddy

机构：

[1] Osmania University,Department of Computer Science and Engineering

[2] CVR College of Engineering,Department of IT

[3] Mahatma Gandhi Institute of Technology,Department of Computer Science and Engineering

来源：

Knowledge and Information Systems | 2024年 / 66卷

关键词：

Data stream clustering; K-means clustering; Hybrid group search pelican optimization; Micro-cluster formation; Cluster organization; Cluster optimization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Big data applications generate a huge range of evolving, real-time, and high-dimensional streaming data. In many applications, data stream clustering regarding efficiency and effectiveness becomes challenging. A major issue in data mining is clustering of data streams. The several clustering techniques were implemented for stream data, but they are mostly quite restricted approaches to cluster dynamics. Generally, the data stream is an arrival of data sequence and also several factors are added in the clustering, which is rather than the classical clustering. For every data point, the stream is mostly unbounded and also the data has been estimated atleast once. It leads to higher processing time and an additional requirement on memory. In addition, the clusters in each data and their statistical property vary over time, and streams can be noisy. To address these challenges, this research work aims to implement a novel data stream clustering which is developed with a hybrid meta-heuristic model. Initially, a data stream is collected, and the micro-clusters are formed by the K-Means Clustering (KMC) technique. Then, the formation of micro-clusters, merge and sorting of the data clusters, where the cluster optimization is performed by the Hybrid Group Search Pelican Optimization (HGSPO). The main objective of the clustering is performed to maximize the accuracy through the radius, distance and similarity measures and then, the thresholds of these metrics are optimized. In the training phase, a stream of clustering threshold is fixed for each cluster. When new data comes into this stream clustering model, the output of training data is measured with new data output that is decided to forward the data into the appropriate clusters based on the assigned threshold with minimum similarity. Through the performance analysis and the attained results, the clustering quality of the recommended system is ensured regarding standard performance metrics by estimating with various clustering and heuristic algorithms.

引用

页码：2467 / 2500

页数：33

共 50 条

[21] An evaluation of k-means as a local search operator in hybrid memetic group search optimization for data clustering
Luciano D. S. Pacifico
Teresa B. Ludermir
Natural Computing, 2021, 20 : 611 - 636
[22] An evaluation of k-means as a local search operator in hybrid memetic group search optimization for data clustering
Pacifico, Luciano D. S.
Ludermir, Teresa B.
NATURAL COMPUTING, 2021, 20 (03) : 611 - 636
[23] A Support Vector and K-Means Based Hybrid Intelligent Data Clustering Algorithm
Sun, Liang
Yoshida, Shinichi
Liang, Yanchun
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (11) : 2234 - 2243
[24] Research on Data Stream Clustering Based on FCM Algorithm
Gao, Tiancheng
Li, Aihua
Meng, Fan
5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 595 - 602
[25] IMPROVED DENSITY BASED ALGORITHM FOR DATA STREAM CLUSTERING
Mousavi, Maryam
Abu Bakar, Azuraliza
JURNAL TEKNOLOGI, 2015, 77 (18): : 73 - 77
[26] THE CLUSTERING ALGORITHM OF EVOLUTIONAL DATA STREAM BASED ON DENSITY
Meng, Yuyu
Zheng, Liying
3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 473 - 477
[27] Clustering Algorithm Based on Grid and Density for Data Stream
Wang, Lang
Li, Haiqing
MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
[28] Drifted Data Stream Clustering Based on ClusTree Algorithm
Zgraja, Jakub
Wozniak, Michal
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 338 - 349
[29] Application Research of Intelligent Search System Based on Clustering Algorithm
Liu, Tao
Hao, Lihong
Huang, Saixiao
Qin, Xiaoya
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 1362 - 1373
[30] A hybrid group search optimization: firefly algorithm-based big data framework for ancient script recognition
Suganya, T. S.
Murugavalli, S.
SOFT COMPUTING, 2020, 24 (14) : 10933 - 10941

← 1 2 3 4 5 →