Binary Search-Based Fast Scheduling Algorithms for Reliability-Aware Energy-Efficient Task Graph Scheduling With Fault Tolerance

被引:1
作者
Biswas, Sajib K. [1 ]
Muhuri, Pranab K. [1 ]
Roy, Uttam K. [1 ]
机构
[1] South Asian Univ, Dept Comp Sci, Delhi 110068, India
来源
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING | 2024年 / 9卷 / 03期
关键词
Reliability; Task analysis; Schedules; Energy consumption; Fault tolerant systems; Fault tolerance; Energy efficiency; Directed acyclic graph; dynamic voltage and frequency scaling; energy-efficient scheduling; fault tolerance; greedy binary search; heterogeneous computing systems; reliability; search-space; RELIABLE PARALLEL APPLICATIONS; PRECEDENCE CONSTRAINED TASKS; SYSTEMS; OPTIMIZATION; CONSUMPTION;
D O I
10.1109/TSUSC.2023.3295939
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Among the available processor-level energy savings schemes, dynamic voltage and frequency scaling (DVFS) is very popular and effective due to its widespread cross-platform use in designing energy-efficient scheduling algorithms. However, rapid frequency switching by DVFS based algorithms while minimizing the energy consumptions may result transient failures in the system. To avoid such failures and their catastrophic consequences, energy-efficient scheduling algorithms with the capabilities to provide more reliable task schedules are always in demand. Therefore, this paper introduces two novel low complexity energy-efficient task scheduling algorithms for heterogeneous computing environments. We term the first algorithm as 'binary search-based energy-efficient scheduling with reliability goal (BSESRG)' for running parallel task graphs in heterogeneous computing systems. We show that the proposed BSESRG has the capability to reduce energy consumption, and shorten the total schedule length by meeting the reliability goals up to a certain threshold. Then, we present our second algorithm, the 'binary search-based energy-efficient fault-tolerant scheduling with reliability goal (BSESRG-FT), which ensures meeting the reliability goals with simultaneous consideration of fault tolerance. The proposed BSESRG-FT is able to reach higher reliability goals, reduce energy consumption, and shorten the total schedule length of a parallel task graph on heterogeneous platforms. We demonstrate the working of both BSESRG and BSESRG-FT through simulation experiments considering real-world task graphs, and show the supremacy of the two proposed algorithms over their respective peers (viz., ESRG and EFSRG) in terms of energy savings, schedule lengths, run times and reliability goals. The superiority of the proposed BSESRG and BSESRG-FT over their respective competitors are also validated on the real benchmark MiBench. Moreover, from the complexity analysis, we respectively find the time complexities of BSESRG and BSESRG-FT as O(|X| x |P| x log(2)|F|) and O (|X| x |P|(2 )x log(2)|F|) confirming their better computational efficiency than the respective peers.
引用
收藏
页码:433 / 451
页数:19
相关论文
共 49 条
  • [1] Thermal-Aware Standby-Sparing Technique on Heterogeneous Real-Time Embedded Systems
    Ansari, Mohsen
    Safari, Sepideh
    Yari-Karin, Sina
    Gohari-Nazari, Pourya
    Khdr, Heba
    Shafique, Muhammad
    Henkel, Joerg
    Ejlali, Alireza
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (04) : 1883 - 1897
  • [2] Peak-Power-Aware Primary-Backup Technique for Efficient Fault-Tolerance in Multicore Embedded Systems
    Ansari, Mohsen
    Salehi, Mohammad
    Safari, Sepideh
    Ejlali, Alireza
    Shafique, Muhammad
    [J]. IEEE ACCESS, 2020, 8 (08): : 142843 - 142857
  • [3] Simultaneous Management of Peak-Power and Reliability in Heterogeneous Multicore Embedded Systems
    Ansari, Mohsen
    Saber-Latibari, Javad
    Pasandideh, Mostafa
    Ejlali, Alireza
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (03) : 623 - 633
  • [4] List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table
    Arabnejad, Hamid
    Barbosa, Jorge G.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (03) : 682 - 694
  • [5] Energy-Saving Task Scheduling Based on Hard Reliability Requirements: A Novel Approach with Low Energy Consumption and High Reliability
    Chen, Qingfeng
    Han, Yu
    Wu, Jing
    Gan, Yu
    [J]. SUSTAINABILITY, 2022, 14 (11)
  • [6] Near-optimal energy-efficient partial-duplication task mapping of real-time parallel applications
    Cui, Minyu
    Kritikakou, Angeliki
    Mo, Lei
    Casseau, Emmanuel
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 134
  • [7] Fault-Tolerant Mapping of Real-Time Parallel Applications under multiple DVFS schemes
    Cui, Minyu
    Kritikakou, Angeliki
    Mo, Lei
    Casseau, Emmanuel
    [J]. 2021 IEEE 27TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS 2021), 2021, : 387 - 399
  • [8] Slow Replica and Shared Protection: Energy-Efficient and Reliable Task Assignment in Cloud Data Centers
    Fan, Yuqi
    Wang, Chen
    Wu, Weili
    Znati, Taieb
    Du, Dingzhu
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (03) : 931 - 943
  • [9] Energy-Efficient Scheduling Algorithms with Reliability Goal on Heterogeneous Embedded Systems
    Han, Yu
    Hu, Wei
    Liu, Jing
    Gan, Yu
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 555 - 562
  • [10] On Reliability Management of Energy-Aware Real-Time Systems Through Task Replication
    Haque, Mohammad A.
    Aydin, Hakan
    Zhu, Dakai
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (03) : 813 - 825