SciLance: Mitigate Load Imbalance for Parallel Scientific Applications in Cloud Environments

被引:0
作者
Wang, Xinying [1 ]
Wan, Lipeng [2 ]
Klasky, Scott [3 ]
Zhao, Dongfang [4 ]
Yan, Feng [5 ]
机构
[1] Univ Nevada, Reno, NV 89557 USA
[2] Georgia State Univ, Atlanta, GA USA
[3] Oak Ridge Natl Lab, Oak Ridge, TN USA
[4] Univ Washington, Tacoma, WA USA
[5] Univ Houston, Houston, TX USA
来源
2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, CLUSTER | 2023年
基金
美国国家科学基金会;
关键词
load balancing; resource management; parallel computing;
D O I
10.1109/CLUSTER52292.2023.00012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Elastic cloud computing provides new opportunities for accelerating the process of scientific discovery. However, unlike high-performance computing (HPC) systems that are built and optimized for workloads with intensive inter-node communication demands, the low-latency and high bandwidth communication capability is only enabled on a few very expensive high-end instance types in the cloud, which leads to poor cost-effectiveness. In addition, re-balancing the workload through extra data movement among compute nodes is a common way to mitigate the load imbalance issue in many scientific simulations, which further amplifies the communication pressure and makes it challenging to efficiently use cloud resources. To this end, we propose SciLance, which addresses the workload imbalance challenge by utilizing the heterogeneous and elastic resources offered by cloud platforms. Particularly, instead of moving data excessively among compute instances to balance the workload, SciLance dynamically adjusts the computer instances used for running parallel tasks based on the runtime imbalance identified through profiling. We prototype SciLance and perform extensive evaluation using adaptive mesh refinement (AMR) based scientific applications. The evaluation results demonstrate that SciLance can achieve up to 36.63% better performance with 16.91% lower cost for AMR-based simulation codes.
引用
收藏
页码:49 / 59
页数:11
相关论文
共 50 条
  • [31] Priority-based Virtual Machine Load Balancing in a Scientific Federated Cloud
    Jaikar, Amol Hindurao
    Huang, Dada
    Kim, Gyeong-Ryoon
    Noh, Seo-Young
    2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (CLOUDNET), 2014, : 248 - 254
  • [32] Automatic runtime load balancing of dedicated applications in heterogeneous environments
    Höfinger, S
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2002, 2474 : 62 - 69
  • [33] Proposing A Load Balancing Algorithm For The Optimization Of Cloud Computing Applications
    Shafiq, Dalia Abdulkareem
    Jhanjhi, N. Z.
    Abdullah, Azween
    2019 13TH INTERNATIONAL CONFERENCE ON MATHEMATICS, ACTUARIAL SCIENCE, COMPUTER SCIENCE AND STATISTICS (MACS-13), 2019,
  • [34] Honey bee behavior inspired load balancing of tasks in cloud computing environments
    Babu, Dhinesh L. D.
    Krishna, P. Venkata
    APPLIED SOFT COMPUTING, 2013, 13 (05) : 2292 - 2303
  • [35] Intelligent Decision-Making of Load Balancing Using Deep Reinforcement Learning and Parallel PSO in Cloud Environment
    Pradhan, Arabinda
    Bisoy, Sukant Kishoro
    Kautish, Sandeep
    Jasser, Muhammed Basheer
    Mohamed, Ali Wagdy
    IEEE ACCESS, 2022, 10 : 76939 - 76952
  • [36] A Parallel Fuzzy Load Balancing Algorithm for Distributed Nodes Over a Cloud System
    Hamdani, Mostefa
    Aklouf, Youcef
    Bouarara, Hadj Ahmed
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2022, 14 (01)
  • [37] Parallel dynamic load balancing strategies for adaptive irregular applications
    Biswas, R
    Das, SK
    Harvey, DJ
    Oliker, L
    APPLIED MATHEMATICAL MODELLING, 2000, 25 (02) : 109 - 122
  • [38] A Scientific Workflow Management System for orchestration of parallel components in a cloud of large-scale parallel processing services
    Silva, Jefferson de Carvalho
    de Oliveira Dantas, Allberson Bruno
    de Carvalho Junior, Francisco Heron
    SCIENCE OF COMPUTER PROGRAMMING, 2019, 173 : 95 - 127
  • [39] Parallel extremal optimization in processor load balancing for distributed applications
    De Falco, Ivanoe
    Laskowski, Eryk
    Olejnik, Richard
    Scafuri, Umberto
    Tarantino, Ernesto
    Tudruj, Marek
    APPLIED SOFT COMPUTING, 2016, 46 : 187 - 203
  • [40] Mobile agents based load balancing method for parallel applications
    Hninn Aye Thant
    Khaing Moe San
    Khin Mar Lar Tun
    Thinn Thu Naing
    Nilar Thein
    APSITT 2005: 6TH ASIA-PACIFIC SYMPOSIUM ON INFORMATION AND TELECOMMUNICATION TECHNOLOGIES, PROCEEDINGS, 2005, : 77 - 82