Clustering-based data placement in cloud computing: a predictive approach

被引:0
|
作者
Mokhtar Sellami
Haithem Mezni
Mohand Said Hacid
Mohamed Moshen Gammoudi
机构
[1] University of Jendouba,
[2] Taibah University,undefined
[3] SMART Lab,undefined
[4] ISG de Tunis,undefined
[5] Univ. Lyon,undefined
[6] University Claude Bernard Lyon 1,undefined
[7] LIRIS,undefined
[8] Higher Institute of Multimedia Arts of Manouba,undefined
[9] RIADI,undefined
来源
Cluster Computing | 2021年 / 24卷
关键词
Data placement; Resource usage; Intensive jobs; Prediction; Kernel Density Estimation; Fuzzy FCA; SOA; Autonomic computing;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, cloud computing environments have become a natural choice to host and process a huge volume of data. The combination of cloud computing and big data frameworks is an effective way to run data-intensive applications and tasks. Also, an optimal arrangement of data partitions can improve the tasks executions, which is not the case in most big data frameworks. For example, the default distribution of data partitions in Hadoop-based clouds causes several problems, which are mainly related to the load balancing and the resource usage. In addition, most existing data placement solutions are static and lack precision in the placement of data partitions. To overcome these issues, we propose a data placement approach based on the prediction of the future resources usage. We exploit Kernel Density Estimation (KDE) and Fuzzy FCA techniques to, first, forecast the workers’ and tasks’ future resource consumption and, second, cluster data partitions and intensive jobs according to the estimated resource usage. Fuzzy FCA is also used to exclude partitions and jobs that require less resources, which will reduce the needless migrations. To allow monitoring and predicting the workers’ states and the data partitions’ consumption, we modeled the big data cluster as an autonomic service-based system. The obtained results have shown that our solution outperformed existing approaches in terms of migrations rate and resource consumption.
引用
收藏
页码:3311 / 3336
页数:25
相关论文
共 50 条
  • [41] TEMPERATURE MATRIX-BASED DATA PLACEMENT OPTIMIZATION IN EDGE COMPUTING ENVIRONMENT
    Wang, Pengwei
    Zhao, Yuying
    Huang, Hengdi
    Zhang, Zhaohui
    COMPUTING AND INFORMATICS, 2022, 41 (06) : 1465 - 1490
  • [42] iFogRep: An intelligent consistent approach for replication and placement of IoT based on fog computing
    Saleh, Safa'a S.
    Alansari, Iman
    Hamiaz, Mounira Kezadri
    Ead, Waleed
    Tarabishi, Rana A.
    Khater, Hatem
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (02) : 327 - 339
  • [43] BDAP: A Big Data Placement Strategy for Cloud-Based Scientific Workflows
    Ebrahimi, Mahdi
    Mohan, Aravind
    Kashlev, Andrey
    Lu, Shiyong
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 105 - 114
  • [44] Improving Cloud-based Online Social Network Data Placement and Replication
    Khalajzadeh, Hourieh
    Yuan, Dong
    Grundy, John
    Yang, Yun
    PROCEEDINGS OF 2016 IEEE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2016, : 678 - 685
  • [45] Optimal Data Placement for Scientific Workflows in Cloud
    Shrivastava, Manish
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2024, 64 (04) : 501 - 517
  • [46] A prediction-Based VM consolidation approach in IaaS Cloud Data Centers
    Mandhi, Tarek
    Mezni, Haithem
    JOURNAL OF SYSTEMS AND SOFTWARE, 2018, 146 : 263 - 285
  • [47] Cellular Clustering-Based Interference-Aware Data Transmission Protocol for Underwater Acoustic Sensor Networks
    Zhang, Jie
    Cai, Mengying
    Han, Guangjie
    Qian, Yujie
    Shu, Lei
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 3217 - 3230
  • [48] A Data Placement Strategy for Data-Intensive Cloud Storage
    Ding, Jie
    Han, Haiyun
    Zhou, Aihua
    PROGRESS IN POWER AND ELECTRICAL ENGINEERING, PTS 1 AND 2, 2012, 354-355 : 896 - 900
  • [49] A data placement strategy for data-intensive applications in cloud
    Zheng P.
    Cui L.-Z.
    Wang H.-Y.
    Xu M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (08): : 1472 - 1480
  • [50] Data Placement for Multi-Tenant Data Federation on the Cloud
    Liu, Ji
    Mo, Lei
    Yang, Sijia
    Zhou, Jingbo
    Ji, Shilei
    Xiong, Haoyi
    Dou, Dejing
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (02) : 1414 - 1429