Reliability/Performance-Aware Scheduling for Parallel Applications With Energy Constraints on Heterogeneous Computing Systems

被引:9
作者
Peng, Jiwu [1 ]
Li, Kenli
Chen, Jianguo [2 ]
Li, Keqin [1 ,3 ,4 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 117684, Singapore
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[4] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
来源
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING | 2022年 / 7卷 / 03期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Reliability; Task analysis; Energy consumption; Schedules; Scheduling; Program processors; Scheduling algorithms; DVFS; energy consumption constrained; energy demand rate; parallel application scheduling; performance and reliability; reliability performance ratio; MAXIMIZING RELIABILITY; CONSERVATION; ALGORITHM;
D O I
10.1109/TSUSC.2022.3146138
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous Computing Systems (HCSs) have developed rapidly due to their high performance and low cost, and have been adopted by more and more applications. Energy consumption, reliability, and schedule length are the core issues of HCSs. Due to the negative correlation between frequency and reliability, DVFS-supported HCSs requires high energy consumption and a long schedule length to obtain high reliability, which resulting in performance degradation. In this paper, we focus on the reliability and performance-aware scheduling for energy-constrained parallel applications on HCSs. First, we design an energy pre-allocation mechanism based on Energy Demand Rate (EDR) to pre-allocate energy reasonably. Second, we propose an EDR-aware Maximizing Reliability of Energy-Constrained parallel applications (EMREC) scheduling algorithm. Third, considering that maximize reliability will cause the schedule length to be too long and unacceptable, we further highlight the concept of Reliability Performance Ratio (RPR). Finally, we propose a Maximizing RPR with Energy-Constrained parallel applications (MRPEC) scheduling algorithm, which enables parallel applications have a smaller schedule length while with high reliability. Extensive experimental results in real-world and randomly generated applications show the effectiveness of the proposed algorithms under different conditions.
引用
收藏
页码:681 / 695
页数:15
相关论文
共 50 条
[31]   A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems [J].
Wang, Shuli ;
Li, Kenli ;
Mei, Jing ;
Xiao, Guoqing ;
Li, Keqin .
JOURNAL OF GRID COMPUTING, 2017, 15 (01) :23-39
[32]   A reliability-aware and energy-aware task scheduling algorithm for heterogeneous multi-core systems [J].
Liu, Dongyang ;
Chen, Junhua ;
Huang, Xueda ;
Hong, Haojun .
PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, :3212-3217
[33]   Energy-Aware Profit Maximizing Scheduling Algorithm for Heterogeneous Computing Systems [J].
Tarplee, Kyle M. ;
Maciejewski, Anthony A. ;
Siegel, Howard Jay .
2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, :595-603
[34]   GOLGI: Performance-Aware, Resource-Efficient Function Scheduling for Serverless Computing [J].
Li, Suyi ;
Wang, Wei ;
Yang, Jun ;
Chen, Guangzhen ;
Lu, Daohe .
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON CLOUD COMPUTING, SOCC 2023, 2023, :32-47
[35]   Research on High-Reliability Energy-Aware Scheduling Strategy for Heterogeneous Distributed Systems [J].
Chen, Ziyu ;
Wu, Jing ;
Cheng, Lin ;
Tao, Tao .
BIG DATA AND COGNITIVE COMPUTING, 2025, 9 (06)
[36]   Reliability aware Scheduling In Cloud Computing [J].
Malik, Sheheryar ;
Huet, Fabrice ;
Caramel, Denis .
2012 INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS, 2012, :194-201
[37]   A DVFS-Weakly Dependent Energy-Efficient Scheduling Approach for Deadline-Constrained Parallel Applications on Heterogeneous Systems [J].
Huang, Jing ;
Li, Renfa ;
An, Jiyao ;
Zeng, Haibo ;
Chang, Wanli .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2021, 40 (12) :2481-2494
[38]   Roofline Model Based Performance-Aware Energy Management for Scientific Computing [J].
Wang, Yunlan ;
Zhao, Tianhai ;
Li, Lu ;
Hou, Zhengxiong ;
Gu, Jianhua .
2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, :74-80
[39]   Reliability Enhancement Strategies for Workflow Scheduling Under Energy Consumption Constraints in Clouds [J].
Zhang, Longxin ;
Ai, Minghui ;
Liu, Ke ;
Chen, Jianguo ;
Li, Kenli .
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 9 (02) :155-169
[40]   Energy aware DAG scheduling on heterogeneous systems [J].
Sanjeev Baskiyar ;
Rabab Abdel-Kader .
Cluster Computing, 2010, 13 :373-383