A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

被引:54
|
作者
Du, Xin [1 ]
Tang, Songtao [1 ]
Lu, Zhihui [1 ,2 ]
Wu, Jie [1 ,5 ]
Gai, Keke [3 ]
Hung, Patrick C. K. [4 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[2] Shanghai Blockchain Engn Res Ctr, Shanghai, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[4] Univ Ontario Inst Technol, Fac Business & IT, Oshawa, ON, Canada
[5] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai, Peoples R China
来源
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020) | 2020年
基金
中国国家自然科学基金;
关键词
Heterogeneous edge-cloud computing environments; data placement; data-sharing; scientific workflows; MODEL;
D O I
10.1109/ICWS49710.2020.00073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
deployment of datasets in the heterogeneous edge-cloud computing paradigm has received increasing attention in state-of-the-art research. However, due to their large sizes and the existence of private scientific datasets, finding an optimal data placement strategy that can minimize data transmission as well as improve performance, remains a persistent problem. In this study, the advantages of both edge and cloud computing are combined to construct a data placement model that works for multiple scientific workflows. Apparently, the most difficult research challenge is to provide a data placement strategy to consider shared datasets, both within individual and among multiple workflows, across various geographically distributed environments. According to the constructed model, not only the storage capacity of edge micro-datacenters, but also the data transfer between multiple clouds across regions must be considered. To address this issue, we considered the characteristics of this model and identified the factors that are causing the transmission delay. The authors propose using a discrete particle swarm optimization algorithm with differential evolution (DE-DPSO) to distribute dataset during workflow execution. Based on this, a new data placement strategy named DE-DPSO-DPS is proposed. DE-DPSO-DPS is evaluated using several experiments designed in simulated heterogeneous edge-cloud computing environments. The results demonstrate that our data placement strategy can effectively reduce the data transmission time and achieve superior performance as compared to traditional strategies for data-sharing scientific workflows.
引用
收藏
页码:498 / 507
页数:10
相关论文
共 49 条
  • [1] Scientific Workflows in IoT Environments: A Data Placement Strategy Based on Heterogeneous Edge-Cloud Computing
    Du, Xin
    Tang, Songtao
    Lu, Zhihui
    Gai, Keke
    Wu, Jie
    Hung, Patrick C. K.
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (04)
  • [2] A Novel Workflow-Level Data Placement Strategy for Data-Sharing Scientific Cloud Workflows
    Li, Xuejun
    Zhang, Lei
    Wu, Yang
    Liu, Xiao
    Zhu, Erzhou
    Yi, Huikang
    Wang, Futian
    Zhang, Cheng
    Yang, Yun
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2019, 12 (03) : 370 - 383
  • [3] A Data Placement Strategy for Data-Intensive Scientific Workflows in Cloud
    Zhao, Qing
    Xiong, Congcong
    Zhao, Xi
    Yu, Ce
    Xiao, Jian
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 928 - 934
  • [5] A Time-Driven Data Placement Strategy for a Scientific Workflow Combining Edge Computing and Cloud Computing
    Lin, Bing
    Zhu, Fangning
    Zhang, Jianshan
    Chen, Jiaqing
    Chen, Xing
    Xiong, Naixue N.
    Mauri, Jaime Lloret
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (07) : 4254 - 4265
  • [6] A Novel Cost-aware Data Placement Strategy for Edge-Cloud Collaborative Smart Systems
    Zhang, Yifei
    Xu, Jia
    Liu, Xiao
    Pan, Wuzhen
    Li, Xuejun
    2023 IEEE 16TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD, 2023, : 450 - 456
  • [7] Security-aware intermediate data placement strategy in scientific cloud workflows
    Liu, Wei
    Peng, Su
    Du, Wei
    Wang, Wei
    Zeng, Guo Sun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 423 - 447
  • [8] Time-driven data placement strategy for scientific workflows in hybrid cloud
    Lin B.
    Xiang T.
    Chen G.
    Chen X.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (04): : 909 - 919
  • [9] BDAP: A Big Data Placement Strategy for Cloud-Based Scientific Workflows
    Ebrahimi, Mahdi
    Mohan, Aravind
    Kashlev, Andrey
    Lu, Shiyong
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 105 - 114
  • [10] Security-aware intermediate data placement strategy in scientific cloud workflows
    Wei Liu
    Su Peng
    Wei Du
    Wei Wang
    Guo Sun Zeng
    Knowledge and Information Systems, 2014, 41 : 423 - 447