Clustering-based data placement in cloud computing: a predictive approach

被引:0
|
作者
Mokhtar Sellami
Haithem Mezni
Mohand Said Hacid
Mohamed Moshen Gammoudi
机构
[1] University of Jendouba,
[2] Taibah University,undefined
[3] SMART Lab,undefined
[4] ISG de Tunis,undefined
[5] Univ. Lyon,undefined
[6] University Claude Bernard Lyon 1,undefined
[7] LIRIS,undefined
[8] Higher Institute of Multimedia Arts of Manouba,undefined
[9] RIADI,undefined
来源
Cluster Computing | 2021年 / 24卷
关键词
Data placement; Resource usage; Intensive jobs; Prediction; Kernel Density Estimation; Fuzzy FCA; SOA; Autonomic computing;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, cloud computing environments have become a natural choice to host and process a huge volume of data. The combination of cloud computing and big data frameworks is an effective way to run data-intensive applications and tasks. Also, an optimal arrangement of data partitions can improve the tasks executions, which is not the case in most big data frameworks. For example, the default distribution of data partitions in Hadoop-based clouds causes several problems, which are mainly related to the load balancing and the resource usage. In addition, most existing data placement solutions are static and lack precision in the placement of data partitions. To overcome these issues, we propose a data placement approach based on the prediction of the future resources usage. We exploit Kernel Density Estimation (KDE) and Fuzzy FCA techniques to, first, forecast the workers’ and tasks’ future resource consumption and, second, cluster data partitions and intensive jobs according to the estimated resource usage. Fuzzy FCA is also used to exclude partitions and jobs that require less resources, which will reduce the needless migrations. To allow monitoring and predicting the workers’ states and the data partitions’ consumption, we modeled the big data cluster as an autonomic service-based system. The obtained results have shown that our solution outperformed existing approaches in terms of migrations rate and resource consumption.
引用
收藏
页码:3311 / 3336
页数:25
相关论文
共 50 条
  • [31] Predictive service placement in cloud using deep learning and frequent subgraph mining
    Haithem Mezni
    Fatimetou Sidi Hamoud
    Faouzi Ben Charrada
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 11497 - 11516
  • [32] Predictive service placement in cloud using deep learning and frequent subgraph mining
    Mezni, Haithem
    Hamoud, Fatimetou Sidi
    Ben Charrada, Faouzi
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (9) : 11497 - 11516
  • [33] Popularity-Based Data Placement With Load Balancing in Edge Computing
    Wei, Xinliang
    Wang, Yu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (01) : 397 - 411
  • [34] A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments
    Du, Xin
    Tang, Songtao
    Lu, Zhihui
    Wu, Jie
    Gai, Keke
    Hung, Patrick C. K.
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020), 2020, : 498 - 507
  • [35] Improving the Robustness of Local Network Alignment: Design and Extensive Assessment of a Markov Clustering-Based Approach
    Mina, Marco
    Guzzi, Pietro Hiram
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (03) : 561 - 572
  • [36] Manifold clustering-based prediction for dynamic multiobjective optimization
    Yan, Li
    Qi, Wenlong
    Qin, A. K.
    Yang, Shengxiang
    Gong, Dunwei
    Qu, Boyang
    Liang, Jing
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 77
  • [37] Clustering-Based Spatial Interpolation of Parametric Postprocessing Models
    Baran, Sandor
    Lakatos, Maria
    WEATHER AND FORECASTING, 2024, 39 (11) : 1591 - 1604
  • [38] Data Placement Strategy for Massive Data Applications based on FCA Approach
    Brahmi, Zaki
    Mili, Sahar
    Derouiche, Rihab
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [39] Design of SPRINT Parallelization of Data Mining Algorithms Based on Cloud Computing
    Song, Lei
    Zhang, Huajie
    Feng, Dongdong
    ENGINEERING LETTERS, 2022, 30 (02) : 399 - 405
  • [40] A novel cloud model based data placement strategy for data-intensive application in clouds
    Zhang, Xinxin
    Hu, Zhigang
    Zheng, Meiguang
    Li, Jia
    Yang, Liu
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 77 : 445 - 456