ResourceExchange: Latency-Aware Scheduling in Virtualized Environments with High Performance Fabrics

被引:3
|
作者
Ranadive, Adit [1 ]
Gavrilovska, Ada [1 ]
Schwan, Karsten [1 ]
机构
[1] Georgia Inst Technol, CERCS, Atlanta, GA 30332 USA
关键词
D O I
10.1109/CLUSTER.2011.14
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Virtualized infrastructures have seen strong acceptance in data center systems and applications, but have not yet seen adoptance for latency-sensitive codes which require I/O to arrive predictability, or response times to be generated within certain timeliness guarantees. Examples of such applications include certain classes of parallel HPC codes, server systems performing phonecall or multimedia delivery, or financial services in electronic trading platforms, like ICE and CME. In this paper, we argue that the use of high-performance, VMM-bypass capable devices can help create the virtualized infrastructures needed for the latency-sensitive applications listed above. However, to enable consolidation, problems to be solved go beyond efficient I/O virtualization, and include dealing with the shared use of I/O and compute resource, in ways that minimize or eliminate interference. Toward this end, we describe ResEx - a resource management approach for virtualized RDMA-based platforms which incorporates concepts from supply-demand theory and congestion pricing to dynamically control the allocation of CPU and I/O resources of guest VMs. ResEx and its mechanisms and abstractions allow multiple 'pricing policies' to be deployed on these types of virtualized platforms, including such which reduce interference and enhance isolation by identifying and taxing VMs responsible for resource congestion. While the main ideas behind ResEx are more general, the design presented in this paper is specific for InfiniBand RDMA-based virtualized platforms due to the use of asynchronous monitoring needed to determine the VMs' I/O usage, and the methods to establish the trading rate for the underlying CPU and I/O resources. The latter is particularly necessary since the hypervisor's only mechanism to control I/O usage is by making appropriate adjustments in the VM's CPU resources. The experimental evaluation of our solution uses InfiniBand platforms virtualized with the open source Xen hypervisor, and an RDMA-based latency-sensitive benchmark, BenchEx, based on a model of a financial trading platform. The results demonstrate the utility of the ResEx approach in making RDMA-based virtualized platforms more manageable and better suited for hosting even latency-sensitive workloads. ResEx can reduce the latency interference by as much as 30% in some cases as shown.
引用
收藏
页码:45 / 53
页数:9
相关论文
共 50 条
  • [21] Performance Analysis of Latency-Aware Data Management in Industrial IoT Networks
    Raptis, Theofanis P.
    Passarella, Andrea
    Conti, Marco
    SENSORS, 2018, 18 (08)
  • [22] Latency-Aware Function Placement, Routing, and Scheduling in TSN-based Industrial Networks
    Bhattacharjee, Sushmit
    Alexandris, Konstantinos
    Hansen, Emil
    Pop, Paul
    Bauschert, Thomas
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4248 - 4254
  • [23] Latency-Aware Container Scheduling in Edge Cluster Upgrades: A Deep Reinforcement Learning Approach
    Cui, Hanshuai
    Tang, Zhiqing
    Lou, Jiong
    Jia, Weijia
    Zhao, Wei
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2530 - 2543
  • [24] Energy and Latency-Aware Scheduling Under Channel Uncertainties in LTE Networks for Massive IoT
    Mohammad Reza Mardani
    Salman Mohebi
    Mohammad Ghanbari
    Wireless Personal Communications, 2018, 103 : 2137 - 2154
  • [25] Quality/Latency-Aware Real-time Scheduling of Distributed Streaming IoT Applications
    Barijough, Kamyar Mirzazad
    Zhao, Zhuoran
    Gerstlauer, Andreas
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
  • [26] TOPOSCH: Latency-Aware Scheduling Based on Critical Path Analysis on Shared YARN Clusters
    Hu, Chunming
    Zhu, Jianyong
    Yang, Renyu
    Peng, Hao
    Wo, Tianyu
    Xue, Shiqing
    Yu, Xiaoqiang
    Xu, Jie
    Ranjan, Rajiv
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 619 - 627
  • [27] Energy and Latency-Aware Scheduling Under Channel Uncertainties in LTE Networks for Massive IoT
    Mardani, Mohammad Reza
    Mohebi, Salman
    Ghanbari, Mohammad
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (03) : 2137 - 2154
  • [28] Practical Latency-aware Scheduling for Low-latency Elephant VR Flows in Wi-Fi Networks
    Lu, Shao-Jung
    Chen, Wei-Xun
    Su, Yu-Shao
    Chang, Yu-Shou
    Liu, Yao-Wen
    Li, Chi-Yu
    Tu, Guan-Hua
    2024 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PERCOM, 2024, : 57 - 68
  • [29] Error source and latency-aware read performance optimization scheme for aged SSDs
    Nie, Shiqiang
    Zhang, Chi
    Zhang, Chen
    Zheng, Xuda
    Wu, Weiguo
    IEICE ELECTRONICS EXPRESS, 2021, 18 (08):
  • [30] Latency-Aware Neural Architecture Performance Predictor With Query-to-Tier Technique
    Guo, Bicheng
    Xu, Lilin
    Chen, Tao
    Ye, Peng
    He, Shibo
    Liu, Haoyu
    Chen, Jiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5868 - 5883