ResourceExchange: Latency-Aware Scheduling in Virtualized Environments with High Performance Fabrics

被引：3

作者：

Ranadive, Adit ^{[1
]}

Gavrilovska, Ada ^{[1
]}

Schwan, Karsten ^{[1
]}

机构：

[1] Georgia Inst Technol, CERCS, Atlanta, GA 30332 USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2011年

关键词：

D O I：

10.1109/CLUSTER.2011.14

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Virtualized infrastructures have seen strong acceptance in data center systems and applications, but have not yet seen adoptance for latency-sensitive codes which require I/O to arrive predictability, or response times to be generated within certain timeliness guarantees. Examples of such applications include certain classes of parallel HPC codes, server systems performing phonecall or multimedia delivery, or financial services in electronic trading platforms, like ICE and CME. In this paper, we argue that the use of high-performance, VMM-bypass capable devices can help create the virtualized infrastructures needed for the latency-sensitive applications listed above. However, to enable consolidation, problems to be solved go beyond efficient I/O virtualization, and include dealing with the shared use of I/O and compute resource, in ways that minimize or eliminate interference. Toward this end, we describe ResEx - a resource management approach for virtualized RDMA-based platforms which incorporates concepts from supply-demand theory and congestion pricing to dynamically control the allocation of CPU and I/O resources of guest VMs. ResEx and its mechanisms and abstractions allow multiple 'pricing policies' to be deployed on these types of virtualized platforms, including such which reduce interference and enhance isolation by identifying and taxing VMs responsible for resource congestion. While the main ideas behind ResEx are more general, the design presented in this paper is specific for InfiniBand RDMA-based virtualized platforms due to the use of asynchronous monitoring needed to determine the VMs' I/O usage, and the methods to establish the trading rate for the underlying CPU and I/O resources. The latter is particularly necessary since the hypervisor's only mechanism to control I/O usage is by making appropriate adjustments in the VM's CPU resources. The experimental evaluation of our solution uses InfiniBand platforms virtualized with the open source Xen hypervisor, and an RDMA-based latency-sensitive benchmark, BenchEx, based on a model of a financial trading platform. The results demonstrate the utility of the ResEx approach in making RDMA-based virtualized platforms more manageable and better suited for hosting even latency-sensitive workloads. ResEx can reduce the latency interference by as much as 30% in some cases as shown.

引用

页码：45 / 53

页数：9

共 50 条

[21] Performance Analysis of Latency-Aware Data Management in Industrial IoT Networks
Raptis, Theofanis P.
Passarella, Andrea
Conti, Marco
SENSORS, 2018, 18 (08)
[22] Latency-Aware Function Placement, Routing, and Scheduling in TSN-based Industrial Networks
Bhattacharjee, Sushmit
Alexandris, Konstantinos
Hansen, Emil
Pop, Paul
Bauschert, Thomas
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4248 - 4254
[23] Latency-Aware Container Scheduling in Edge Cluster Upgrades: A Deep Reinforcement Learning Approach
Cui, Hanshuai
Tang, Zhiqing
Lou, Jiong
Jia, Weijia
Zhao, Wei
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2530 - 2543
[24] Energy and Latency-Aware Scheduling Under Channel Uncertainties in LTE Networks for Massive IoT
Mohammad Reza Mardani
Salman Mohebi
Mohammad Ghanbari
Wireless Personal Communications, 2018, 103 : 2137 - 2154
[25] Quality/Latency-Aware Real-time Scheduling of Distributed Streaming IoT Applications
Barijough, Kamyar Mirzazad
Zhao, Zhuoran
Gerstlauer, Andreas
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
[26] TOPOSCH: Latency-Aware Scheduling Based on Critical Path Analysis on Shared YARN Clusters
Hu, Chunming
Zhu, Jianyong
Yang, Renyu
Peng, Hao
Wo, Tianyu
Xue, Shiqing
Yu, Xiaoqiang
Xu, Jie
Ranjan, Rajiv
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 619 - 627
[27] Energy and Latency-Aware Scheduling Under Channel Uncertainties in LTE Networks for Massive IoT
Mardani, Mohammad Reza
Mohebi, Salman
Ghanbari, Mohammad
WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (03) : 2137 - 2154
[28] Practical Latency-aware Scheduling for Low-latency Elephant VR Flows in Wi-Fi Networks
Lu, Shao-Jung
Chen, Wei-Xun
Su, Yu-Shao
Chang, Yu-Shou
Liu, Yao-Wen
Li, Chi-Yu
Tu, Guan-Hua
2024 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PERCOM, 2024, : 57 - 68
[29] Error source and latency-aware read performance optimization scheme for aged SSDs
Nie, Shiqiang
Zhang, Chi
Zhang, Chen
Zheng, Xuda
Wu, Weiguo
IEICE ELECTRONICS EXPRESS, 2021, 18 (08):
[30] Latency-Aware Neural Architecture Performance Predictor With Query-to-Tier Technique
Guo, Bicheng
Xu, Lilin
Chen, Tao
Ye, Peng
He, Shibo
Liu, Haoyu
Chen, Jiming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5868 - 5883

← 1 2 3 4 5 →