RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers

被引:0
|
作者
Zhu, Hang [1 ]
Kaffes, Kostis [2 ]
Chen, Zixu [1 ]
Liu, Zhenming [3 ]
Kozyrakis, Christos [2 ]
Stoica, Ion [4 ]
Jin, Xin [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] Coll William & Mary, Williamsburg, VA 23187 USA
[4] Univ Calif Berkeley, Berkeley, CA USA
来源
PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20) | 2020年
关键词
TAIL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Low-latency online services have strict Service Level Objectives (SLOs) that require datacenter systems to support high throughput at microsecond-scale tail latency. Dataplane operating systems have been designed to scale up multi-core servers with minimal overhead for such SLOs. However, as application demands continue to increase, scaling up is not enough, and serving larger demands requires these systems to scale out to multiple servers in a rack. We present RackSched, the first rack-level microsecond-scale scheduler that provides the abstraction of a rack-scale computer (i.e., a huge server with hundreds to thousands of cores) to an external service with network-system co-design. The core of RackSched is a two-layer scheduling framework that integrates inter-server scheduling in the top-of-rack (ToR) switch with intra-server scheduling in each server. We use a combination of analytical results and simulations to show that it provides near-optimal performance as centralized scheduling policies, and is robust for both low-dispersion and high-dispersion workloads. We design a custom switch data plane for the inter-server scheduler, which realizes power-of-k-choices, ensures request affinity, and tracks server loads accurately and efficiently. We implement a RackSched prototype on a cluster of commodity servers connected by a Barefoot Tofino switch. End-to-end experiments on a twelve-server testbed show that RackSched improves the throughput by up to 1.44 x , and scales out the throughput near linearly, while maintaining the same tail latency as one server until the system is saturated.
引用
收藏
页码:1225 / 1240
页数:16
相关论文
共 50 条
  • [41] An Ultra-Low Latency and Compatible PCIe Interconnect for Rack-scale Communication
    Huang, Yibo
    Huang, Yukai
    Yan, Ming
    Hu, Jiayu
    Liang, Cunming
    Xu, Yang
    Zou, Wenxiong
    Zhang, Yiming
    Zhang, Rui
    Huang, Chunpu
    Wu, Jie
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, CONEXT 2022, 2022, : 232 - 244
  • [42] Evolution characteristics of shock wave in microsecond-scale underwater electrical wire explosion
    Huang, Yifan
    Gu, Tianyi
    Chen, Yao
    Zeng, Hanshi
    Han, Xiaotao
    PHYSICS OF FLUIDS, 2024, 36 (12)
  • [43] Characterization of shallow acceptors in GaAs by microsecond-scale time-resolved photoluminescence
    Gilinsky, AM
    Zhuravlev, KS
    APPLIED PHYSICS LETTERS, 1996, 68 (03) : 373 - 375
  • [44] Microsecond-scale staircase voltammetry for measuring the electrical conductivity of highly conductive liquids
    Kim, Jong-Yun
    Choi, Yong Suk
    Bae, Sang-Eun
    Park, Tae -Hong
    Kim, Tae-Hyeong
    JOURNAL OF INDUSTRIAL AND ENGINEERING CHEMISTRY, 2023, 120 : 90 - 95
  • [45] Microsecond-scale X-ray imaging with Controlled-Drift Detectors
    Castoldi, A
    Galimberti, A
    Guazzoni, C
    Rehak, P
    Strüder, L
    NUCLEAR PHYSICS B-PROCEEDINGS SUPPLEMENTS, 2006, 150 : 150 - 154
  • [46] Microsecond-scale formation of Ni-Ti intermetallics in thermal spray coatings
    Tran, A. T. T.
    Goutier, S.
    Vardelle, A.
    Hyland, M. M.
    SURFACE & COATINGS TECHNOLOGY, 2017, 321 : 425 - 437
  • [47] Board- and Rack-scale optical interconnection architectures for disaggregated Data Centers
    Terzenidis, N.
    Moralis-Pegios, M.
    Pitris, S.
    Mourgias-Alexandris, G.
    Mitsolidou, C.
    Fotiadis, K.
    Vyrsokinos, K.
    Alexoudi, T.
    Pleros, N.
    OPTICAL INTERCONNECTS XX, 2020, 11286
  • [48] Characterization of shallow acceptors in GaAs by microsecond-scale time-resolved photoluminescence
    Sibierian Branch of Russian Acad of, Sciences, Novosibirsk, Russia
    Appl Phys Lett, 3 (373-375):
  • [49] MICROSECOND-SCALE ELECTRIC-FIELD PULSES IN-CLOUD LIGHTNING DISCHARGES
    VILLANUEVA, Y
    RAKOV, VA
    UMAN, MA
    BROOK, M
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 1994, 99 (D7) : 14353 - 14360
  • [50] Scalable Low-Power High-Performance Rack-Scale Optical Network
    Feng, Jun
    Wang, Zhehui
    Wang, Zhifei
    Chen, Xuanqi
    Chen, Shixi
    Zhang, Jiaxu
    Xu, Jiang
    PROCEEDINGS OF 2019 IEEE/ACM WORKSHOP ON PHOTONICS-OPTICS TECHNOLOGY ORIENTED NETWORKING, INFORMATION AND COMPUTING SYSTEMS (PHOTONICS2019), 2019, : 1 - 6