Fast VMM-based overlay networking for bridging the cloud and high performance computing

被引:0
作者
Lei Xia
Zheng Cui
John Lange
Yuan Tang
Peter Dinda
Patrick Bridges
机构
[1] Northwestern University,
[2] University of New Mexico,undefined
[3] University of Pittsburgh,undefined
[4] University of Electronic Science and Technology of China,undefined
来源
Cluster Computing | 2014年 / 17卷
关键词
Overlay networks; Virtualization; HPC; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
A collection of virtual machines (VMs) interconnected with an overlay network with a layer 2 abstraction has proven to be a powerful, unifying abstraction for adaptive distributed and parallel computing on loosely-coupled environments. It is now feasible to allow VMs hosting high performance computing (HPC) applications to seamlessly bridge distributed cloud resources and tightly-coupled supercomputing and cluster resources. However, to achieve the application performance that the tightly-coupled resources are capable of, it is important that the overlay network not introduce significant overhead relative to the native hardware, which is not the case for current user-level tools, including our own existing VNET/U system. In response, we describe the design, implementation, and evaluation of a virtual networking system that has negligible latency and bandwidth overheads in 1–10 Gbps networks. Our system, VNET/P, is directly embedded into our publicly available Palacios virtual machine monitor (VMM). VNET/P achieves native performance on 1 Gbps Ethernet networks and very high performance on 10 Gbps Ethernet networks. The NAS benchmarks generally achieve over 95 % of their native performance on both 1 and 10 Gbps. We have further demonstrated that VNET/P can operate successfully over more specialized tightly-coupled networks, such as Infiniband and Cray Gemini. Our results suggest it is feasible to extend a software-based overlay network designed for computing at wide-area scales into tightly-coupled environments.
引用
收藏
页码:39 / 59
页数:20
相关论文
共 50 条
  • [21] Performance analysis based resource allocation for green cloud computing
    Lee, Hwa Min
    Jeong, Young-Sik
    Jang, Haeng Jin
    JOURNAL OF SUPERCOMPUTING, 2014, 69 (03) : 1013 - 1026
  • [22] Performance analysis based resource allocation for green cloud computing
    Hwa Min Lee
    Young-Sik Jeong
    Haeng Jin Jang
    The Journal of Supercomputing, 2014, 69 : 1013 - 1026
  • [23] The SMART4ALL High Performance Computing Infrastructure: Sharing high-end hardware resources via cloud-based microservices
    Voros, Angelos S.
    Panagiotou, Christos
    Zogas, Stavros
    Keramidas, Georgios
    Antonopoulos, Christos P.
    Hubner, Michael
    Voros, Nikolaos S.
    2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, : 384 - 385
  • [24] A Proactive Fault Tolerance Approach to High Performance Computing (HPC) in the Cloud
    Egwutuoha, Ifeanyi P.
    Chen, Shiping
    Levy, David
    Selic, Bran
    Calvo, Rafael
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 268 - 273
  • [25] Energy Efficient Fault Tolerance for High Performance Computing (HPC) in the Cloud
    Egwutuoha, Ifeanyi P.
    Chen, Shiping
    Levy, David
    Selic, Bran
    Calvo, Rafael
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 762 - 769
  • [26] Distributed High Performance Computing in OpenStack Cloud over SDN Infrastructure
    Basnet, Sadhu Ram
    Chaulagain, Ram Sharan
    Pandey, Santosh
    Shakya, Subarna
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 144 - 148
  • [27] A low-overhead networking mechanism for virtualized high-performance computing systems
    Jang, Jae-Wan
    Seo, Euiseong
    Jo, Heeseung
    Kim, Jin-Soo
    JOURNAL OF SUPERCOMPUTING, 2012, 59 (01) : 443 - 468
  • [28] A low-overhead networking mechanism for virtualized high-performance computing systems
    Jae-Wan Jang
    Euiseong Seo
    Heeseung Jo
    Jin-Soo Kim
    The Journal of Supercomputing, 2012, 59 : 443 - 468
  • [29] InfiniCloud: Leveraging the global infinicortex fabric and openstack cloud for borderless high performance computing of genomic data
    Ban, Kenneth
    Chrzeszczyk, Jakub
    Howard, Andrew
    Li, Dongyang
    Tan, Tin Wee
    Supercomputing Frontiers and Innovations, 2015, 2 (03) : 14 - 27
  • [30] Dynamic Bayesian network based prediction of performance parameters in cloud computing
    Bharti, Priyanka
    Ranjan, Rajeev
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (04) : 368 - 381