Efficient clustering in data mining applications based on harmony search and k-medoids

被引:0
作者
Ranjbar Noshari, Moein [1 ]
Azgomi, Hossein [2 ]
Asghari, Ali [1 ]
机构
[1] Department of Computer Engineering, Shafagh Institute of Higher Education, Tonekabon
[2] Department of Computer Engineering, Rasht Branch, Islamic Azad University, Rasht
关键词
Clustering; Data mining; Harmony search; k-Medoids; Metaheuristic algorithm;
D O I
10.1007/s00500-024-10337-6
中图分类号
学科分类号
摘要
There are now ever-increasing amounts of digital data stored daily. Processing such big data requires new sciences such as data mining. Data mining aims to find the information and knowledge hidden in these massive amounts of data. Data mining has various branches and applications, one of which is clustering. In clustering, data are classified into several clusters. The data of one cluster are similar to each other but different from those of other clusters. Processing time is considered a major challenge, given the use of big data in data mining applications. Metaheuristic algorithms can solve large-scale time-consuming problems in acceptable durations. The harmony search is one of such algorithms. This paper proposes a hybrid method for data clustering by integrating the harmony search algorithm (HSA) with k-medoids, which is a clustering algorithm. Based on k-medoids, the proposed method uses real data samples as cluster centroids to eliminate the effects of noise data. Furthermore, the HSA reduces the data clustering duration. In addition, trees social relations optimization algorithm (TSR) was used to increase the quality of the solutions in the proposed method. The proposed method was compared practically with the other techniques. According to the comparison results, the proposed method outperformed similar techniques in terms of the solution fitness and Dunn and Silhouette criteria. The temporal outputs of the proposed method were also acceptable. The best results obtained for the proposed method in the form of fitness of the final solution is equal to 1.02, execution time is equal to 2.9562 s, Dunn index is equal to 0.067 and Silhouette is equal to 1. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:13245 / 13268
页数:23
相关论文
共 37 条
[1]  
Abdulkhaleq M.T., Rashid T.A., Alsadoon A., Hassan B.A., Mohammadi M., Abdullah J.M., Chhabra A., Ali S.L., Othman R.N., Hasan H.A., Azad S., Vimal S., Harmony search: current studies and uses on healthcare systems, Artif Intell Med, (2022)
[2]  
Ala'a A., Alsewari A.A., Alamri H.S., Zamli K.Z., Comprehensive review of the development of the harmony search algorithm and its applications, IEEE Access, 7, pp. 14233-14245, (2019)
[3]  
Alam T., Qamar S., Dixit A., Benaida M., Genetic Algorithm: Reviews, Implementations, and Applications, (2020)
[4]  
Alimoradi M., Azgomi H., Asghari A., Trees social relations optimization algorithm: a new swarm-based metaheuristic technique to solve continuous and discrete optimization problems, Math Comput Simul, 194, pp. 629-664, (2022)
[5]  
Al-Otaibi S., Cherappa V., Thangarajan T., Shanmugam R., Ananth P., Arulswamy S., Hybrid K-medoids with energy-efficient sunflower optimization algorithm for wireless sensor networks, Sustainability, 15, 7, (2023)
[6]  
Azgomi H., Sohrabi M.K., A novel coral reefs optimization algorithm for materialized view selection in data warehouse environments, Appl Intell, 49, 11, pp. 3965-3989, (2019)
[7]  
Azgomi H., Sohrabi M.K., MR-MVPP: a map-reduce-based approach for creating MVPP in data warehouses for big data applications, Inf Sci, 570, pp. 200-224, (2021)
[8]  
Daliri A., Asghari A., Azgomi H., Alimoradi M., The water optimization algorithm: a novel metaheuristic for solving optimization problems, Appl Intell, 52, 15, pp. 17990-18029, (2022)
[9]  
Daviran M., Ghezelbash R., Niknezhad M., Maghsoudi A., Ghaeminejad H., Hybridizing K-means clustering algorithm with harmony search and artificial bee colony optimizers for intelligence mineral prospectivity mapping, Earth Sci Inform, (2023)
[10]  
Delahaye D., Chaimatanan S., Mongeau M., Simulated annealing: from basics to applications, Handb Metaheuristics, (2019)