Clustering-based data placement in cloud computing: a predictive approach

被引:0
|
作者
Mokhtar Sellami
Haithem Mezni
Mohand Said Hacid
Mohamed Moshen Gammoudi
机构
[1] University of Jendouba,
[2] Taibah University,undefined
[3] SMART Lab,undefined
[4] ISG de Tunis,undefined
[5] Univ. Lyon,undefined
[6] University Claude Bernard Lyon 1,undefined
[7] LIRIS,undefined
[8] Higher Institute of Multimedia Arts of Manouba,undefined
[9] RIADI,undefined
来源
Cluster Computing | 2021年 / 24卷
关键词
Data placement; Resource usage; Intensive jobs; Prediction; Kernel Density Estimation; Fuzzy FCA; SOA; Autonomic computing;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, cloud computing environments have become a natural choice to host and process a huge volume of data. The combination of cloud computing and big data frameworks is an effective way to run data-intensive applications and tasks. Also, an optimal arrangement of data partitions can improve the tasks executions, which is not the case in most big data frameworks. For example, the default distribution of data partitions in Hadoop-based clouds causes several problems, which are mainly related to the load balancing and the resource usage. In addition, most existing data placement solutions are static and lack precision in the placement of data partitions. To overcome these issues, we propose a data placement approach based on the prediction of the future resources usage. We exploit Kernel Density Estimation (KDE) and Fuzzy FCA techniques to, first, forecast the workers’ and tasks’ future resource consumption and, second, cluster data partitions and intensive jobs according to the estimated resource usage. Fuzzy FCA is also used to exclude partitions and jobs that require less resources, which will reduce the needless migrations. To allow monitoring and predicting the workers’ states and the data partitions’ consumption, we modeled the big data cluster as an autonomic service-based system. The obtained results have shown that our solution outperformed existing approaches in terms of migrations rate and resource consumption.
引用
收藏
页码:3311 / 3336
页数:25
相关论文
共 50 条
  • [21] A clustering-based adaptive undersampling ensemble method for highly unbalanced data classification
    Yuan, Xiaohan
    Sun, Chuan
    Chen, Shuyu
    APPLIED SOFT COMPUTING, 2024, 159
  • [22] DCCP: an effective data placement strategy for data-intensive computations in distributed cloud computing systems
    Tao Wang
    Shihong Yao
    Zhengquan Xu
    Shan Jia
    The Journal of Supercomputing, 2016, 72 : 2537 - 2564
  • [23] DCCP: an effective data placement strategy for data-intensive computations in distributed cloud computing systems
    Wang, Tao
    Yao, Shihong
    Xu, Zhengquan
    Jia, Shan
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (07) : 2537 - 2564
  • [24] Graphical-based data placement algorithm for cloud workflow
    Zhang, Peng
    Wang, Guiling
    Han, Yanbo
    Wang, Jing
    High Technology Letters, 2014, 20 (02) : 179 - 186
  • [25] Graphical-based data placement algorithm for cloud workflow
    张鹏
    Wang Guiling
    Han Yanbo
    Wang Jing
    HighTechnologyLetters, 2014, 20 (02) : 179 - 186
  • [26] A clustering based coscheduling strategy for efficient scientific workflow execution in cloud computing
    Deng, Kefeng
    Ren, Kaijun
    Song, Junqiang
    Yuan, Dong
    Xiang, Yang
    Chen, Jinjun
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (18) : 2523 - 2539
  • [27] Forecasting Bus Passenger Flows by Using a Clustering-Based Support Vector Regression Approach
    Li, Chuan
    Wang, Xiaodan
    Cheng, Zhiwei
    Bai, Yun
    IEEE ACCESS, 2020, 8 : 19717 - 19725
  • [28] A novel clustering-based purity and distance imputation for handling medical data with missing values
    Cheng, Ching-Hsue
    Huang, Shu-Fen
    SOFT COMPUTING, 2021, 25 (17) : 11781 - 11801
  • [29] Tourists Flow Prediction by Clustering-Based GRNN
    Hu, Yuting
    Xie, Rong
    Zhang, Wenjun
    ADVANCES ON DIGITAL TELEVISION AND WIRELESS MULTIMEDIA COMMUNICATIONS, 2012, 331 : 396 - 402
  • [30] An Energy-saving Data Transmission Approach based on Migrating Virtual Machine Technology to Cloud Computing
    Reddy, Pundru Chandra Shaker
    Sucharitha, Yadala
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2024, 17 (06) : 573 - 581