Scavenger: A Black-Box Batch Workload Resource Manager for Improving Utilization in Cloud Environments

被引:37
作者
Javadi, Seyyed Ahmad [1 ]
Suresh, Amoghavarsha [1 ]
Wajahat, Muhammad [1 ]
Gandhi, Anshul [1 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, PACE Lab, Stony Brook, NY 11794 USA
来源
PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19) | 2019年
基金
美国国家科学基金会;
关键词
Cloud computing; resource utiliization; background workload; HIGH-PERFORMANCE; QOS;
D O I
10.1145/3357223.3362734
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Resource rider-utilization is common in cloud data centers. Prior works have proposed improving utilization by running provider workloads in the background, colocated with tenant workloads. However, an important challenge that has still not been addressed is considering the tenant workloads as a black-box. We present Scavenger, a batch workload manager that opportunistically runs containerized batch jobs next to black-box tenant VMs to improve utilization. Scavenger is designed to work without requiring any offline profiling or prior information about the tenant workload. To meet the tenant VMs' resource demand at all Limes, Scavenger dynamically regulates the resource usage of batch jobs, including processor usage, memory capacity, and network bandwidth. We experimentally evaluate Scavenger on two different testbeds using latency-sensitive tenant workloads colocated with Spark jobs in the background and show that Scavenger significantly increases resource usage without compromising the resource demands of tenant VMs.
引用
收藏
页码:272 / 285
页数:14
相关论文
共 55 条
[1]   PARTIES: QoS-Aware Resource Partitioning for Multiple Interactive Services [J].
Chen, Shuang ;
Delimitrou, Christina ;
Martinez, Jose F. .
TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, :107-120
[2]  
Cooper Brian F, 2010, P 1 ACM S CLOUD COMP, P143, DOI DOI 10.1145/1807128.1807152
[3]   Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms [J].
Cortez, Eli ;
Bonde, Anand ;
Muzio, Alexandre ;
Russinovich, Mark ;
Fontoura, Marcus ;
Bianchini, Ricardo .
PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, :153-167
[4]  
DeCandia Giuseppe, 2007, Operating Systems Review, V41, P205, DOI 10.1145/1323293.1294281
[5]   Tarcil: Reconciling Scheduling Speed and Quality in Large Shared Clusters [J].
Delimitrou, Christina ;
Sanchez, Daniel ;
Kozyrakis, Christos .
ACM SoCC'15: Proceedings of the Sixth ACM Symposium on Cloud Computing, 2015, :97-110
[6]   Quasar: Resource-Efficient and QoS-Aware Cluster Management [J].
Delimitrou, Christina ;
Kozyrakis, Christos .
ACM SIGPLAN NOTICES, 2014, 49 (04) :127-143
[7]   Paragon: QoS-Aware Scheduling for Heterogeneous Datacenters [J].
Delimitrou, Christina ;
Kozyrakis, Christos .
ACM SIGPLAN NOTICES, 2013, 48 (04) :77-88
[8]  
Douglas C., 2013, P 4 ANN S
[9]   Fairness via Source Throttling: A Configurable and High-Performance Fairness Substrate for Multi-Core Memory Systems [J].
Ebrahimi, Eiman ;
Lee, Chang Joo ;
Mutlu, Onur ;
Patt, Yale N. .
ACM SIGPLAN NOTICES, 2010, 45 (03) :335-346
[10]   Clearing the Clouds A Study of Emerging Scale-out Workloads on Modern Hardware [J].
Ferdman, Michael ;
Adileh, Almutaz ;
Kocberber, Onur ;
Volos, Stavros ;
Alisafaee, Mohammad ;
Jevdjic, Djordje ;
Kaynak, Cansu ;
Popescu, Adrian Daniel ;
Ailamaki, Anastasia ;
Falsafi, Babak .
ACM SIGPLAN NOTICES, 2012, 47 (04) :37-47