Reliability/Performance-Aware Scheduling for Parallel Applications With Energy Constraints on Heterogeneous Computing Systems

被引:9
作者
Peng, Jiwu [1 ]
Li, Kenli
Chen, Jianguo [2 ]
Li, Keqin [1 ,3 ,4 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 117684, Singapore
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[4] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
来源
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING | 2022年 / 7卷 / 03期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Reliability; Task analysis; Energy consumption; Schedules; Scheduling; Program processors; Scheduling algorithms; DVFS; energy consumption constrained; energy demand rate; parallel application scheduling; performance and reliability; reliability performance ratio; MAXIMIZING RELIABILITY; CONSERVATION; ALGORITHM;
D O I
10.1109/TSUSC.2022.3146138
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous Computing Systems (HCSs) have developed rapidly due to their high performance and low cost, and have been adopted by more and more applications. Energy consumption, reliability, and schedule length are the core issues of HCSs. Due to the negative correlation between frequency and reliability, DVFS-supported HCSs requires high energy consumption and a long schedule length to obtain high reliability, which resulting in performance degradation. In this paper, we focus on the reliability and performance-aware scheduling for energy-constrained parallel applications on HCSs. First, we design an energy pre-allocation mechanism based on Energy Demand Rate (EDR) to pre-allocate energy reasonably. Second, we propose an EDR-aware Maximizing Reliability of Energy-Constrained parallel applications (EMREC) scheduling algorithm. Third, considering that maximize reliability will cause the schedule length to be too long and unacceptable, we further highlight the concept of Reliability Performance Ratio (RPR). Finally, we propose a Maximizing RPR with Energy-Constrained parallel applications (MRPEC) scheduling algorithm, which enables parallel applications have a smaller schedule length while with high reliability. Extensive experimental results in real-world and randomly generated applications show the effectiveness of the proposed algorithms under different conditions.
引用
收藏
页码:681 / 695
页数:15
相关论文
共 33 条
[1]   A Survey on Scheduling Strategies for Workflows in Cloud Environment and Emerging Trends [J].
Adhikari, Mainak ;
Amgoth, Tarachand ;
Srirama, Satish Narayana .
ACM COMPUTING SURVEYS, 2019, 52 (04)
[2]  
[Anonymous], 2015, TASK GRAPH GENERATOR
[3]   Simultaneous Management of Peak-Power and Reliability in Heterogeneous Multicore Embedded Systems [J].
Ansari, Mohsen ;
Saber-Latibari, Javad ;
Pasandideh, Mostafa ;
Ejlali, Alireza .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (03) :623-633
[4]   Modelling DVFS and UFS for Region-Based Energy Aware Tuning of HPC Applications [J].
Chadha, Mohak ;
Gerndt, Michael .
2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, :805-814
[5]  
Chen Q., 2017, TASK SCHEDULING FORM
[6]   IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems [J].
Djigal, Hamza ;
Feng, Jun ;
Lu, Jiamin ;
Ge, Jidong .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) :1057-1071
[7]   A Dynamic Programming Framework for DVFS-Based Energy-Efficiency in Multicore Systems [J].
Hajiamini, Shervin ;
Shirazi, Behrooz ;
Crandall, Aaron ;
Ghasemzadeh, Hassan .
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2020, 5 (01) :1-12
[8]   Energy-Efficient Resource Utilization for Heterogeneous Embedded Computing Systems [J].
Huang, Jing ;
Li, Renfa ;
An, Jiyao ;
Ntalasha, Derrick ;
Yang, Fan ;
Li, Keqin .
IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (09) :1518-1531
[9]   Performance Evaluation and Energy Efficiency of High-Density HPC Platforms Based on Intel, AMD and ARM Processors [J].
Jarus, Mateusz ;
Varrette, Sebastien ;
Oleksiak, Ariel ;
Bouvry, Pascal .
ENERGY EFFICIENCY IN LARGE SCALE DISTRIBUTED SYSTEMS, EE-LSDS 2013, 2013, 8046 :182-200
[10]   Cooperative Transmission of Energy-Constrained IoT Devices in Wireless-Powered Communication Networks [J].
Jeong, Cheol ;
Son, Hyukmin .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3972-3982