RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers

被引:0
|
作者
Zhu, Hang [1 ]
Kaffes, Kostis [2 ]
Chen, Zixu [1 ]
Liu, Zhenming [3 ]
Kozyrakis, Christos [2 ]
Stoica, Ion [4 ]
Jin, Xin [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] Coll William & Mary, Williamsburg, VA 23187 USA
[4] Univ Calif Berkeley, Berkeley, CA USA
来源
PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20) | 2020年
关键词
TAIL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Low-latency online services have strict Service Level Objectives (SLOs) that require datacenter systems to support high throughput at microsecond-scale tail latency. Dataplane operating systems have been designed to scale up multi-core servers with minimal overhead for such SLOs. However, as application demands continue to increase, scaling up is not enough, and serving larger demands requires these systems to scale out to multiple servers in a rack. We present RackSched, the first rack-level microsecond-scale scheduler that provides the abstraction of a rack-scale computer (i.e., a huge server with hundreds to thousands of cores) to an external service with network-system co-design. The core of RackSched is a two-layer scheduling framework that integrates inter-server scheduling in the top-of-rack (ToR) switch with intra-server scheduling in each server. We use a combination of analytical results and simulations to show that it provides near-optimal performance as centralized scheduling policies, and is robust for both low-dispersion and high-dispersion workloads. We design a custom switch data plane for the inter-server scheduler, which realizes power-of-k-choices, ensures request affinity, and tracks server loads accurately and efficiently. We implement a RackSched prototype on a cluster of commodity servers connected by a Barefoot Tofino switch. End-to-end experiments on a twelve-server testbed show that RackSched improves the throughput by up to 1.44 x , and scales out the throughput near linearly, while maintaining the same tail latency as one server until the system is saturated.
引用
收藏
页码:1225 / 1240
页数:16
相关论文
共 50 条
  • [21] Dynamic architecture of protein kinases from microsecond-scale simulations
    McClendon, Christopher L.
    Kornev, Alexandr P.
    Gilson, Michael K.
    Taylor, Susan S.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 248
  • [22] Microsecond-Scale Timing Precision in Rodent Trigeminal Primary Afferents
    Bale, Michael R.
    Campagner, Dario
    Erskine, Andrew
    Petersen, Rasmus S.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (15): : 5935 - 5940
  • [23] Ionic Mechanisms of Microsecond-Scale Spike Timing in Single Cells
    Markham, Michael R.
    Zakon, Harold H.
    JOURNAL OF NEUROSCIENCE, 2014, 34 (19): : 6668 - 6678
  • [24] The fracture energy of materials under pulse microsecond-scale loading
    A. A. Gruzdkov
    S. I. Krivosheev
    Yu. V. Petrov
    Physics of the Solid State, 2003, 45 : 886 - 889
  • [25] KRCORE: A Microsecond-scale RDMA Control Plane for Elastic Computing
    Wei, Xingda
    Lu, Fangming
    Chen, Rong
    Chen, Haibo
    PROCEEDINGS OF THE 2022 USENIX ANNUAL TECHNICAL CONFERENCE, 2022, : 121 - 136
  • [26] The fracture energy of materials under pulse microsecond-scale loading
    Gruzdkov, AA
    Krivosheev, SI
    Petrov, YV
    PHYSICS OF THE SOLID STATE, 2003, 45 (05) : 886 - 889
  • [27] Microsecond-scale fast fusing and cutout characteristics of high current fuse
    Zhang, Yu
    Su, Jiancang
    Zheng, Lei
    Li, Rui
    Yu, Binxiong
    Qiu, Xudong
    Zeng, Bo
    Cheng, Jie
    Xu, Xiudong
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2017, 11 (08) : 2094 - 2101
  • [28] STANlite - a database engine for secure data processing at rack-scale level
    Sartakov, Vasily A.
    Weichbrodt, Nico
    Krieter, Sebastian
    Leich, Thomas
    Kapitza, Ruediger
    2018 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2018), 2018, : 23 - 33
  • [29] Rack-scale Disaggregated cloud data centers: The dReDBox project vision
    Katrinis, K.
    Syrivelis, D.
    Pnevmatikatos, D.
    Zervas, G.
    Theodoropoulos, D.
    Koutsopoulos, I.
    Hasharoni, K.
    Raho, D.
    Pinto, C.
    Espina, F.
    Lopez-Buedo, S.
    Chen, Q.
    Nemirovsky, M.
    Roca, D.
    Klos, H.
    Berends, T.
    PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 690 - 695
  • [30] The Case for RackOut: Scalable Data Serving Using Rack-Scale Systems
    Novakovic, Stanko
    Daglis, Alexandros
    Bugnion, Edouard
    Falsafi, Babak
    Grot, Boris
    PROCEEDINGS OF THE SEVENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC 2016), 2016, : 182 - 195