Qin: Unified Hierarchical Cluster-Node Scheduling for Heterogeneous Datacenters

被引:0
作者
Guan, Wenkai [1 ]
Ababei, Cristinel [2 ]
机构
[1] Univ Minnesota, Div Sci & Math, Morris, MN 56267 USA
[2] Marquette Univ, Dept Elect & Comp Engn, Milwaukee, WI 53233 USA
来源
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING | 2025年 / 10卷 / 01期
关键词
Servers; Optimization; Collaboration; Interference; Scheduling; Motion pictures; Scheduling algorithms; Collaborative filtering; datacenters; heterogeneity; Kalman filtering; scheduling; ALLOCATION; EFFICIENT;
D O I
10.1109/TSUSC.2024.3392480
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Energy efficiency is among the most important challenges for computing. There has been an increasing gap between the rate at which the performance of processors has been improving and the lower rate of improvement in energy efficiency. This paper answers the question of how to reduce energy usage in heterogeneous datacenters. It proposes a unified hierarchical scheduling using a D-Choices technique, which considers interference and heterogeneity. Heterogeneity comes from servers' continuous upgrades and the integrated high-performance "big" and energy-efficient "little" cores. This results in datacenters becoming more heterogeneous and traditional job scheduling algorithms become suboptimal. To this end, we present a two-level hierarchical scheduler for datacenters that exploits increased server heterogeneity. It combines in a unified approach cluster and node level scheduling algorithms, and it can consider specific optimization objectives including job completion time, energy usage, and energy-delay-product (EDP). Its novelty lies in the unified approach and in modeling interference and heterogeneity. Experiments on a research cluster found that the proposed approach outperforms state-of-the-art schedulers by around 10% in job completion time, 39% in energy usage, and 42% in EDP. This paper demonstrated a unified approach as a promising direction in optimizing energy and performance for heterogeneous datacenters.
引用
收藏
页码:39 / 56
页数:18
相关论文
共 73 条
  • [1] Power and Thermal-Aware Workload Allocation in Heterogeneous Data Centers
    Al-Qawasmeh, Abdulla M.
    Pasricha, Sudeep
    Maciejewski, Anthony A.
    Siegel, Howard Jay
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (02) : 477 - 491
  • [2] Amazon Elastic Kubernetes Service (EKS), 2023, ABOUT US
  • [3] AmazonAWS admission control, 2022, ABOUT US
  • [4] [Anonymous], 2015, PARSEC 3.0-Beta
  • [5] [Anonymous], 2023, US Energy Information Administration "Electricity Data Browser"
  • [6] [Anonymous], 2022, Google Cloud Platform
  • [7] Azure Kubernetes Service (AKS), 2023, ABOUT US
  • [8] Run-Time Adaptive Workload Estimation for Dynamic Voltage Scaling
    Bang, Sung-Yong
    Bang, Kwanhu
    Yoon, Sungroh
    Chung, Eui-Young
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2009, 28 (09) : 1334 - 1347
  • [9] Barroso LuizAndre., 2009, The Datacenter as a Computer, DOI [DOI 10.2200/S00193ED1V01Y200905CAC006, 10.2200/s00193ed1v01y200905cac006]
  • [10] Balanced allocations:: The heavily loaded case
    Berenbrink, P
    Czumaj, A
    Steger, A
    Vöcking, B
    [J]. SIAM JOURNAL ON COMPUTING, 2006, 35 (06) : 1350 - 1385