ResourceExchange: Latency-Aware Scheduling in Virtualized Environments with High Performance Fabrics

被引:3
|
作者
Ranadive, Adit [1 ]
Gavrilovska, Ada [1 ]
Schwan, Karsten [1 ]
机构
[1] Georgia Inst Technol, CERCS, Atlanta, GA 30332 USA
关键词
D O I
10.1109/CLUSTER.2011.14
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Virtualized infrastructures have seen strong acceptance in data center systems and applications, but have not yet seen adoptance for latency-sensitive codes which require I/O to arrive predictability, or response times to be generated within certain timeliness guarantees. Examples of such applications include certain classes of parallel HPC codes, server systems performing phonecall or multimedia delivery, or financial services in electronic trading platforms, like ICE and CME. In this paper, we argue that the use of high-performance, VMM-bypass capable devices can help create the virtualized infrastructures needed for the latency-sensitive applications listed above. However, to enable consolidation, problems to be solved go beyond efficient I/O virtualization, and include dealing with the shared use of I/O and compute resource, in ways that minimize or eliminate interference. Toward this end, we describe ResEx - a resource management approach for virtualized RDMA-based platforms which incorporates concepts from supply-demand theory and congestion pricing to dynamically control the allocation of CPU and I/O resources of guest VMs. ResEx and its mechanisms and abstractions allow multiple 'pricing policies' to be deployed on these types of virtualized platforms, including such which reduce interference and enhance isolation by identifying and taxing VMs responsible for resource congestion. While the main ideas behind ResEx are more general, the design presented in this paper is specific for InfiniBand RDMA-based virtualized platforms due to the use of asynchronous monitoring needed to determine the VMs' I/O usage, and the methods to establish the trading rate for the underlying CPU and I/O resources. The latter is particularly necessary since the hypervisor's only mechanism to control I/O usage is by making appropriate adjustments in the VM's CPU resources. The experimental evaluation of our solution uses InfiniBand platforms virtualized with the open source Xen hypervisor, and an RDMA-based latency-sensitive benchmark, BenchEx, based on a model of a financial trading platform. The results demonstrate the utility of the ResEx approach in making RDMA-based virtualized platforms more manageable and better suited for hosting even latency-sensitive workloads. ResEx can reduce the latency interference by as much as 30% in some cases as shown.
引用
收藏
页码:45 / 53
页数:9
相关论文
共 50 条
  • [1] JANUS: Latency-Aware Traffic Scheduling for IoT Data Streaming in Edge Environments
    Wen, Zhenyu
    Yang, Renyu
    Qian, Bin
    Xuan, Yubo
    Lu, Lingling
    Wang, Zheng
    Peng, Hao
    Xu, Jie
    Zomaya, Albert Y.
    Ranjan, Rajiv
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (06) : 4302 - 4316
  • [2] Toward Latency-Aware Dynamic Middlebox Scheduling
    Duan, Pengfei
    Li, Qing
    Jiang, Yong
    Xia, Shu-Tao
    24TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS ICCCN 2015, 2015,
  • [3] Latency-aware Virtualized Network Function provisioning for distributed edge clouds
    Son, Jungmin
    Buyya, Rajkumar
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 : 24 - 31
  • [4] Latency-aware Scheduling in the Cloud-Edge Continuum
    Chiaro, Cristopher
    Monaco, Doriana
    Sacco, Alessio
    Casetti, Claudio
    Marchetto, Guido
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [5] Latency-Aware Kubernetes Scheduling for Microservices Orchestration at the Edge
    Centofanti, C.
    Tiberti, W.
    Marotta, A.
    Graziosi, F.
    Cassioli, D.
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 426 - 431
  • [6] Energy and Latency-aware Resource Reconfiguration in Fog Environments
    Godinho, Noe
    Silva, Henrique
    Curado, Marilia
    Paquete, Luis
    2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
  • [7] A Latency-Aware Scheduling Strategy for NR DCP DRX Mechanism
    Wu, Yuh-Tser
    Liu, He-Hsuan
    Lin, Kuang-Hsun
    Wei, Hung-Yu
    2022 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM, APWCS, 2022, : 51 - 55
  • [8] Quokka: Latency-Aware Middlebox Scheduling with dynamic resource allocation
    Li, Qing
    Jiang, Yong
    Duan, Pengfei
    Xu, Mingwei
    Xiao, Xi
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 78 : 253 - 266
  • [9] Latency-Aware Application Module Management for Fog Computing Environments
    Mahmud, Redowan
    Ramamohanarao, Kotagiri
    Buyya, Rajkumar
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2019, 19 (01)
  • [10] FLASH: Foresighted latency-aware scheduling heuristic for processors with customized datapaths
    Kudlur, M
    Fan, K
    Chu, M
    Ravindran, R
    Clark, N
    Mahlke, S
    CGO 2004: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2004, : 201 - 212