A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

被引:54
|
作者
Du, Xin [1 ]
Tang, Songtao [1 ]
Lu, Zhihui [1 ,2 ]
Wu, Jie [1 ,5 ]
Gai, Keke [3 ]
Hung, Patrick C. K. [4 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[2] Shanghai Blockchain Engn Res Ctr, Shanghai, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[4] Univ Ontario Inst Technol, Fac Business & IT, Oshawa, ON, Canada
[5] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai, Peoples R China
来源
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020) | 2020年
基金
中国国家自然科学基金;
关键词
Heterogeneous edge-cloud computing environments; data placement; data-sharing; scientific workflows; MODEL;
D O I
10.1109/ICWS49710.2020.00073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
deployment of datasets in the heterogeneous edge-cloud computing paradigm has received increasing attention in state-of-the-art research. However, due to their large sizes and the existence of private scientific datasets, finding an optimal data placement strategy that can minimize data transmission as well as improve performance, remains a persistent problem. In this study, the advantages of both edge and cloud computing are combined to construct a data placement model that works for multiple scientific workflows. Apparently, the most difficult research challenge is to provide a data placement strategy to consider shared datasets, both within individual and among multiple workflows, across various geographically distributed environments. According to the constructed model, not only the storage capacity of edge micro-datacenters, but also the data transfer between multiple clouds across regions must be considered. To address this issue, we considered the characteristics of this model and identified the factors that are causing the transmission delay. The authors propose using a discrete particle swarm optimization algorithm with differential evolution (DE-DPSO) to distribute dataset during workflow execution. Based on this, a new data placement strategy named DE-DPSO-DPS is proposed. DE-DPSO-DPS is evaluated using several experiments designed in simulated heterogeneous edge-cloud computing environments. The results demonstrate that our data placement strategy can effectively reduce the data transmission time and achieve superior performance as compared to traditional strategies for data-sharing scientific workflows.
引用
收藏
页码:498 / 507
页数:10
相关论文
共 49 条
  • [21] Task scheduling strategy based on data replication in scientific Cloud workflows
    Djebbar, Esma Insaf
    Belalem, Ghalem
    Benadda, Merien
    MULTIAGENT AND GRID SYSTEMS, 2016, 12 (01) : 55 - 67
  • [22] A data placement strategy for big data based on DCC in cloud computing systems
    Wang, Tao
    Yao, Shihong
    Xu, Zhengquan
    Jia, Shan
    Xu, Qiang
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 623 - 630
  • [23] The Design and Evaluation of a Strategy of Data Placement in Cloud Computing Platform
    Guo, Wei
    Luo, Kaibo
    Wang, Xinjun
    Cui, Lizhen
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (01) : 13 - 30
  • [24] Improved Harris Hawks Optimization Algorithm Based Data Placement Strategy for Integrated Cloud and Edge Computing
    Nivethitha, V.
    Aghila, G.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01) : 887 - 904
  • [25] Performance Analysis of Blockchain-Enabled Data-Sharing Scheme in Cloud-Edge Computing-Based IoT Networks
    Okegbile, Samuel D.
    Cai, Jun
    Alfa, Attahiru S.
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21): : 21520 - 21536
  • [26] Privacy Aware Access Control for Data Sharing in Cloud Computing Environments
    Takabi, Hassan
    SCC'14: PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON SECURITY IN CLOUD COMPUTING, 2014, : 27 - 34
  • [27] Context-Aware Data and Task Placement in Edge Computing Environments
    Breitbach, Martin
    Schaefer, Dominik
    Edinger, Janick
    Becker, Christian
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), 2019,
  • [28] Fault Tolerant and Data Oriented Scientific Workflows Management and Scheduling System in Cloud Computing
    Ahmad, Zulfiqar
    Jehangiri, Ali Imran
    Mohamed, Nader
    Othman, Mohamed
    Umar, Arif Iqbal
    IEEE ACCESS, 2022, 10 : 77614 - 77632
  • [29] Imposing Cache: Busy-Aware Cooperative Data Caching in Edge-Cloud Environments
    Lim, Gyujeong
    Kang, Jihun
    Yu, Heonchang
    PROCEEDINGS OF THE 4TH EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, ESSE 2023, 2024, : 61 - 69
  • [30] Multi-objective Optimization for Data Placement Strategy in Cloud Computing
    Guo, Lizheng
    He, Zongyao
    Zhao, Shuguang
    Zhang, Na
    Wang, Junhao
    Jiang, Changyun
    INFORMATION COMPUTING AND APPLICATIONS, PT 2, 2012, 308 : 119 - 126