A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

被引:54
|
作者
Du, Xin [1 ]
Tang, Songtao [1 ]
Lu, Zhihui [1 ,2 ]
Wu, Jie [1 ,5 ]
Gai, Keke [3 ]
Hung, Patrick C. K. [4 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[2] Shanghai Blockchain Engn Res Ctr, Shanghai, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[4] Univ Ontario Inst Technol, Fac Business & IT, Oshawa, ON, Canada
[5] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai, Peoples R China
来源
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020) | 2020年
基金
中国国家自然科学基金;
关键词
Heterogeneous edge-cloud computing environments; data placement; data-sharing; scientific workflows; MODEL;
D O I
10.1109/ICWS49710.2020.00073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
deployment of datasets in the heterogeneous edge-cloud computing paradigm has received increasing attention in state-of-the-art research. However, due to their large sizes and the existence of private scientific datasets, finding an optimal data placement strategy that can minimize data transmission as well as improve performance, remains a persistent problem. In this study, the advantages of both edge and cloud computing are combined to construct a data placement model that works for multiple scientific workflows. Apparently, the most difficult research challenge is to provide a data placement strategy to consider shared datasets, both within individual and among multiple workflows, across various geographically distributed environments. According to the constructed model, not only the storage capacity of edge micro-datacenters, but also the data transfer between multiple clouds across regions must be considered. To address this issue, we considered the characteristics of this model and identified the factors that are causing the transmission delay. The authors propose using a discrete particle swarm optimization algorithm with differential evolution (DE-DPSO) to distribute dataset during workflow execution. Based on this, a new data placement strategy named DE-DPSO-DPS is proposed. DE-DPSO-DPS is evaluated using several experiments designed in simulated heterogeneous edge-cloud computing environments. The results demonstrate that our data placement strategy can effectively reduce the data transmission time and achieve superior performance as compared to traditional strategies for data-sharing scientific workflows.
引用
收藏
页码:498 / 507
页数:10
相关论文
共 49 条
  • [31] A Data Placement Strategy Based on Genetic Algorithm in Cloud Computing Platform
    Guo, Wei
    Wang, Xinjun
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 369 - 372
  • [32] SSDP: A Slot-sensitive Data Placement Strategy in Cloud Computing
    Tian, Tian
    Liu, Peng
    Kuang, HuaXing
    2015 THIRD INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, 2015, : 205 - 212
  • [33] Novel data-placement scheme for improving the data locality of Hadoop in heterogeneous environments
    Bae, Minho
    Yeo, Sangho
    Park, Gyudong
    Oh, Sangyoon
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (18)
  • [34] A data placement strategy based on clustering and consistent hashing algorithm in Cloud Computing
    Li, Qiang
    Wang, Kun
    Wei, Suwei
    Han, Xuefeng
    Xu, Lili
    Gao, Min
    2014 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2014, : 478 - 483
  • [35] A Novel Structure-Based Data Sharing Scheme in Cloud Computing
    Zheng, Huiyao
    Shen, Jian
    Cho, Youngju
    Su, Chunhua
    Moh, Sangman
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (02) : 222 - 229
  • [36] DCCP: an effective data placement strategy for data-intensive computations in distributed cloud computing systems
    Tao Wang
    Shihong Yao
    Zhengquan Xu
    Shan Jia
    The Journal of Supercomputing, 2016, 72 : 2537 - 2564
  • [37] DCCP: an effective data placement strategy for data-intensive computations in distributed cloud computing systems
    Wang, Tao
    Yao, Shihong
    Xu, Zhengquan
    Jia, Shan
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (07) : 2537 - 2564
  • [38] A Secure Data-Sharing Scheme for Privacy-Preserving Supporting Node-Edge-Cloud Collaborative Computation
    Zheng, Kaifa
    Ding, Caiyang
    Wang, Jinchen
    ELECTRONICS, 2023, 12 (12)
  • [39] Novel Data-Distribution Technique for Hadoop in Heterogeneous Cloud Environments
    Ubarhande, Vrushali
    Popescu, Alina-Madalina
    Gonzalez-Velez, Horacio
    2015 9TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS CISIS 2015, 2015, : 217 - 224
  • [40] A novel cloud model based data placement strategy for data-intensive application in clouds
    Zhang, Xinxin
    Hu, Zhigang
    Zheng, Meiguang
    Li, Jia
    Yang, Liu
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 77 : 445 - 456