Efficient Scheduling Policies for Microsecond-Scale Tasks

被引:0
|
作者
McClure, Sarah [1 ]
Ousterhout, Amy [1 ]
Shenker, Scott [1 ,2 ]
Ratnasamy, Sylvia [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] ICSI, Berkeley, CA USA
来源
PROCEEDINGS OF THE 19TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI '22) | 2022年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Datacenter operators today strive to support microsecond-latency applications while also using their limited CPU resources as efficiently as possible. To achieve this, several recent systems allow multiple applications to run on the same server, granting each a dedicated set of cores and reallocating cores across applications over time as load varies. Unfortunately, many of these systems do a poor job of navigating the tradeoff between latency and efficiency, sacrificing one or both, especially when handling tasks as short as 1 mu s. While the implementations of these systems (threading libraries, network stacks, etc.) have been heavily optimized, the policy choices that they make have received less scrutiny. Most systems implement a single choice of policy for allocating cores across applications and for load-balancing tasks across cores within an application. In this paper, we use simulations to compare these different policy options and explore which yield the best combination of latency and efficiency. We conclude that work stealing performs best among loadbalancing policies, multiple policies can perform well for core allocations, and, surprisingly, static core allocations often outperform reallocation with small tasks. We implement the best-performing policy choices by building on Caladan, an existing core-allocating system, and demonstrate that they can yield efficiency improvements of up to 13-22% without degrading (median or tail) latency.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 50 条
  • [1] Aquifer: Transparent Microsecond-Scale Scheduling for vRAN Workloads
    Jia, Yunshan
    Zhong, Yinmin
    Wang, Meng
    Gao, Jiaqi
    Zhang, Pengyu
    Liu, Xuanzhe
    Jin, Xin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 3171 - 3184
  • [2] ZygOS: Achieving Low Tail Latency for Microsecond-scale Networked Tasks
    Prekas, George
    Kogias, Marios
    Bugnion, Edouard
    PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, : 325 - 341
  • [3] Microsecond-Scale Core Reallocation
    Queue, 2023, 21 (02): : 3 - 4
  • [4] Achieving Microsecond-Scale Tail Latency Efficiently with Approximate Optimal Scheduling
    Iyer, Rishabh
    Unal, Musa
    Kogias, Marios
    Candea, George
    PROCEEDINGS OF THE TWENTY-NINTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2023, 2023, : 466 - 481
  • [5] RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers
    Zhu, Hang
    Kaffes, Kostis
    Chen, Zixu
    Liu, Zhenming
    Kozyrakis, Christos
    Stoica, Ion
    Jin, Xin
    PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), 2020, : 1225 - 1240
  • [6] uBFT: Microsecond-Scale BFT using Disaggregated Memory
    Aguilera, Marcos K.
    Ben-David, Naama
    Guerraoui, Rachid
    Murat, Antoine
    Xygkis, Athanasios
    Zablotchi, Igor
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 2, ASPLOS 2023, 2023, : 862 - 877
  • [7] Ultrafast cooling reveals microsecond-scale biomolecular dynamics
    Polinkovsky, Mark E.
    Gambin, Yann
    Banerjee, Priya R.
    Erickstad, Michael J.
    Groisman, Alex
    Deniz, Ashok A.
    NATURE COMMUNICATIONS, 2014, 5
  • [8] Ultrafast cooling reveals microsecond-scale biomolecular dynamics
    Mark E. Polinkovsky
    Yann Gambin
    Priya R. Banerjee
    Michael J. Erickstad
    Alex Groisman
    Ashok A. Deniz
    Nature Communications, 5
  • [9] DORADD: Deterministic Parallel Execution in the Era of Microsecond-Scale Computing
    Liu, Zhengqing
    Unal, Musa
    Parkinson, Matthew J.
    Kogias, Marios
    PROCEEDINGS OF THE 2025 THE 30TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2025, 2025, : 282 - 296
  • [10] Nu: Achieving Microsecond-Scale Resource Fungibility with Logical Processes
    Ruan, Zhenyuan
    Park, Seo Jin
    Aguilera, Marcos K.
    Belay, Adam
    Schwarzkopf, Malte
    PROCEEDINGS OF THE 20TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, NSDI 2023, 2023, : 1409 - 1427