Fast VMM-based overlay networking for bridging the cloud and high performance computing

被引:0
作者
Lei Xia
Zheng Cui
John Lange
Yuan Tang
Peter Dinda
Patrick Bridges
机构
[1] Northwestern University,
[2] University of New Mexico,undefined
[3] University of Pittsburgh,undefined
[4] University of Electronic Science and Technology of China,undefined
来源
Cluster Computing | 2014年 / 17卷
关键词
Overlay networks; Virtualization; HPC; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
A collection of virtual machines (VMs) interconnected with an overlay network with a layer 2 abstraction has proven to be a powerful, unifying abstraction for adaptive distributed and parallel computing on loosely-coupled environments. It is now feasible to allow VMs hosting high performance computing (HPC) applications to seamlessly bridge distributed cloud resources and tightly-coupled supercomputing and cluster resources. However, to achieve the application performance that the tightly-coupled resources are capable of, it is important that the overlay network not introduce significant overhead relative to the native hardware, which is not the case for current user-level tools, including our own existing VNET/U system. In response, we describe the design, implementation, and evaluation of a virtual networking system that has negligible latency and bandwidth overheads in 1–10 Gbps networks. Our system, VNET/P, is directly embedded into our publicly available Palacios virtual machine monitor (VMM). VNET/P achieves native performance on 1 Gbps Ethernet networks and very high performance on 10 Gbps Ethernet networks. The NAS benchmarks generally achieve over 95 % of their native performance on both 1 and 10 Gbps. We have further demonstrated that VNET/P can operate successfully over more specialized tightly-coupled networks, such as Infiniband and Cray Gemini. Our results suggest it is feasible to extend a software-based overlay network designed for computing at wide-area scales into tightly-coupled environments.
引用
收藏
页码:39 / 59
页数:20
相关论文
共 50 条
  • [31] High Performance Computing techniques for fast solving a NDET forward problem
    Duca, Laurentiu
    Ioan, Daniel
    Duca, Anton
    2017 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION FOR RF, MICROWAVE, AND TERAHERTZ APPLICATIONS (NEMO), 2017, : 115 - 118
  • [32] Dry-type transformer optimization using high performance cloud computing: performance evaluation
    Wu, W.
    Gentzsch, W.
    Kern, J. A.
    SOUTHEASTCON 2016, 2016,
  • [33] Marine bathymetry processing through GPGPU virtualization in high performance cloud computing
    Montella, Raffaele
    Marcellino, Livia
    Galletti, Ardelio
    Di Luccio, Diana
    Kosta, Sokol
    Laccetti, Giuliano
    Giunta, Giulio
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (24)
  • [34] Efficient Pre-Copy Live Migration of Virtual Machines for High Performance Computing in Cloud Computing Environments
    Chanchio, Kasidit
    Yaothanee, Jumpol
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 497 - 501
  • [35] Design and Implementation of an Open Source Framework and Prototype For Named Data Networking-Based Edge Cloud Computing System
    Ullah, Rehmat
    Rehman, Muhammad Atif Ur
    Kim, Byung-Seo
    IEEE ACCESS, 2019, 7 : 57741 - 57759
  • [36] Cloud Computing Architecture for High-volume ML-based Solutions
    Rovnyagin, Mikhail M.
    Timofeev, Kirill, V
    Elenkin, Aleksandr A.
    Shipugin, Vladislav A.
    PROCEEDINGS OF THE 2019 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2019, : 315 - 318
  • [37] Cloud-Native Network Slicing Using Software Defined Networking Based Multi-Access Edge Computing: A Survey
    Shah, Syed Danial Ali
    Gregory, Mark A.
    Li, Shuo
    IEEE ACCESS, 2021, 9 : 10903 - 10924
  • [38] Cloud Robot Vision Services Extend High-Performance Computing Capabilities of Robot Systems
    Anton, Florin Daniel
    Borangiu, Theodor
    Anton, Silvia
    Raileanu, Silviu
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2018, 49 : 317 - 327
  • [39] Cost-oriented proactive fault tolerance approach to high performance computing (HPC) in the cloud
    Egwutuoha, Ifeanyi P.
    Chen, Shiping
    Levy, David
    Selic, Bran
    Calvo, Rafael
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2014, 29 (04) : 363 - 378
  • [40] EVOLVE: Towards Converging Big-Data, High-Performance and Cloud-Computing Worlds
    Tzenetopoulos, Achilleas
    Masouros, Dimosthenis
    Koliogeorgi, Konstantina
    Xydis, Sotirios
    Soudris, Dimitrios
    Chazapis, Antony
    Kozanitis, Christos
    Bilas, Angelos
    Pinto, Christian
    Huy-Nam Nguyen
    Louloudakis, Stelios
    Gardikis, Georgios
    Vamvakas, George
    Aubrun, Michelle
    Symeonidou, Christy
    Spitadakis, Vassilis
    Xylogiannopoulos, Konstantinos
    Peischl, Bernhard
    Kalayci, Tahir Emre
    Stocker, Alexander
    Acquaviva, Jean-Thomas
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 975 - 980