Big Data Clustering Techniques Challenges and Perspectives: Review

被引:0
作者
Awad F.H. [1 ]
Hamad M.M. [1 ]
机构
[1] College of Computer Science and Information Technology, University of Anbar, Anbar
来源
Informatica (Slovenia) | 2023年 / 47卷 / 06期
关键词
Big data; clustering; data mining; machine learning;
D O I
10.31449/inf.v47i6.4445
中图分类号
学科分类号
摘要
Clustering in big data is considered a critical data mining and analysis technique. There are issues with adapting clustering algorithms to large amounts of data and new challenges brought by big data. As the size of big data is up to petabytes of data, and clustering methods have high processing costs, the challenge is how to handle this issue and utilize clustering techniques for big data efficiently. This study aims to investigate the recent advancement of clustering platforms and techniques to handle big data issues, from the early suggested techniques to today's novel solutions. The methodology and specific issues for building an effective clustering mechanism are presented and evaluated, followed by a discussion of the choices for enhancing clustering algorithms. A brief literature review of the recent advancement in clustering techniques has been presented to address each solution's main characteristics and drawbacks. © 2023 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:203 / 218
页数:15
相关论文
共 77 条
  • [1] Alshamma Omran, Awad Fouad H, Alzubaidi Laith, Fadhel Mohammed A, Arkah Zinah Mohsin, Farhan Laith, Employment of multi-classifier and multi-domain features for pcg recognition, 2019 12th International Conference on Developments in eSystems Engineering (DeSE), pp. 321-325, (2019)
  • [2] Alzubaidi Laith, Al-Amidie Muthana, Al- Asadi Ahmed, Humaidi Amjad J, Al-Shamma Omran, Fadhel Mohammed A, Zhang Jinglan, Santamara J, Duan Ye, Novel transfer learning approach for medical imaging with limited labeled data, Cancers, 13, 7
  • [3] Alzubaidi Laith, Fadhel Mohammed A, Al- Shamma Omran, Zhang Jinglan, Duan Ye, Deep learning models for classification of red blood cells in microscopy images to aid in sickle cell anemia diagnosis, Electronics, 9, 3, (2020)
  • [4] Alzubaidi Laith, Fadhel Mohammed A, Al- Shamma Omran, Zhang Jinglan, Santamaria J, Duan Ye, Robust application of new deep learning tools: an experimental study in medical imaging, Multimedia Tools and Applications, pp. 1-29, (2021)
  • [5] Alzubaidi Laith, Hasan Reem Ibrahim, Awad Fouad H, Fadhel Mohammed A, Alshamma Omran, Zhang Jinglan, Multi-class breast cancer classification by a novel two-branch deep convolutional neural network architecture, 2019 12th International Conference on Developments in eSystems Engineering (DeSE), pp. 268-273, (2019)
  • [6] Alzubaidi Laith, Zhang Jinglan, Humaidi Amjad J, Al-Dujaili Ayad, Duan Ye, Al-Shamma Omran, Santamaria J, Fadhel Mohammed A, Al-Amidie Muthana, Farhan Laith, Review of deep learning: Concepts, cnn architectures, challenges, applications, future directions, Journal of big Data, 8, 1, pp. 1-74, (2021)
  • [7] Asri Hiba, Mousannif Hajar, Moatassime Hassan Al, Noel Thomas, Big data in healthcare: challenges and opportunities, pp. 1-7, (2015)
  • [8] Assefi Mehdi, Behravesh Ehsun, Liu Guangchi, Tafti Ahmad P, Big data machine learning using apache spark mllib, 2017 ieee international conference on big data (big data), pp. 3492-3498, (2017)
  • [9] Awad Fouad H, Hamad Murtadha M, Improved k-means clustering algorithm for big data based on distributed smartphone neural engine processor, Electronics, 11, 6, (2022)
  • [10] Awad Fouad H, Hamad Murtadha M, Alzubaidi Laith, Robust classification and detection of big medical data using advanced parallel k-means clustering, yolov4, and logistic regression, Life, 13, 3, (2023)