A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

被引:54
|
作者
Du, Xin [1 ]
Tang, Songtao [1 ]
Lu, Zhihui [1 ,2 ]
Wu, Jie [1 ,5 ]
Gai, Keke [3 ]
Hung, Patrick C. K. [4 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[2] Shanghai Blockchain Engn Res Ctr, Shanghai, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[4] Univ Ontario Inst Technol, Fac Business & IT, Oshawa, ON, Canada
[5] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai, Peoples R China
来源
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020) | 2020年
基金
中国国家自然科学基金;
关键词
Heterogeneous edge-cloud computing environments; data placement; data-sharing; scientific workflows; MODEL;
D O I
10.1109/ICWS49710.2020.00073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
deployment of datasets in the heterogeneous edge-cloud computing paradigm has received increasing attention in state-of-the-art research. However, due to their large sizes and the existence of private scientific datasets, finding an optimal data placement strategy that can minimize data transmission as well as improve performance, remains a persistent problem. In this study, the advantages of both edge and cloud computing are combined to construct a data placement model that works for multiple scientific workflows. Apparently, the most difficult research challenge is to provide a data placement strategy to consider shared datasets, both within individual and among multiple workflows, across various geographically distributed environments. According to the constructed model, not only the storage capacity of edge micro-datacenters, but also the data transfer between multiple clouds across regions must be considered. To address this issue, we considered the characteristics of this model and identified the factors that are causing the transmission delay. The authors propose using a discrete particle swarm optimization algorithm with differential evolution (DE-DPSO) to distribute dataset during workflow execution. Based on this, a new data placement strategy named DE-DPSO-DPS is proposed. DE-DPSO-DPS is evaluated using several experiments designed in simulated heterogeneous edge-cloud computing environments. The results demonstrate that our data placement strategy can effectively reduce the data transmission time and achieve superior performance as compared to traditional strategies for data-sharing scientific workflows.
引用
收藏
页码:498 / 507
页数:10
相关论文
共 49 条
  • [41] Migrating Scientific Workflows to the Cloud Through Graph-partitioning, Scheduling and Peer-to-Peer Data Sharing
    Srirama, Satish Narayana
    Viil, Jaagup
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 1105 - 1112
  • [42] A Heterogeneous IoT Data Analysis Framework with Collaboration of Edge-Cloud Computing: Focusing on Indoor PM10 and PM2.5 Status Prediction
    Moon, Jaewon
    Kum, Seungwoo
    Lee, Sangwon
    SENSORS, 2019, 19 (14)
  • [43] IoV data sharing scheme based on the hybrid architecture of blockchain and cloud-edge computing
    Tiange Zheng
    Junhua Wu
    Guangshun Li
    Journal of Cloud Computing, 12
  • [44] IoV data sharing scheme based on the hybrid architecture of blockchain and cloud-edge computing
    Zheng, Tiange
    Wu, Junhua
    Li, Guangshun
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):
  • [45] Flexible heterogeneous data fusion strategy for object positioning applications in edge computing environment
    Li, Chunlin
    Zhang, Yong
    Luo, Youlong
    COMPUTER NETWORKS, 2022, 212
  • [46] Temperature Matrix-Based Data Placement Using Improved Hungarian Algorithm in Edge Computing Environments
    Zhao, Yuying
    Wang, Pengwei
    Huang, Hengdi
    Zhang, Zhaohui
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT 2021, 2022, 13148 : 237 - 248
  • [47] FCA-based Energy Aware-data Placement Strategy for Intensive Workflow in Cloud Computing.
    Derouiche, Rihab
    Brahmi, Zaki
    Gammoudi, Mohamed Mohsen
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 387 - 397
  • [48] RDP: A storage-tier-aware Robust Data Placement strategy for Hadoop in a Cloud-based Heterogeneous Environment
    Qureshi, Nawab Muhammad Faseeh
    Shin, Dong Ryeol
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (09): : 4063 - 4086
  • [49] A User Mobility-Based Data Placement Strategy in a Hybrid Cloud/Edge Environment Using a Causal-Aware Deep Learning Network
    Symvoulidis, Chrysostomos
    Kiourtis, Athanasios
    Marinos, George
    Tom-Ata, Jean-Didier Totow
    Manias, George
    Mavrogiorgou, Argyro
    Kyriazis, Dimosthenis
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (12) : 3603 - 3616