Supporting GPU Sharing in Cloud Environments with a Transparent Runtime Consolidation Framework

被引:0
作者
Ravi, Vignesh T. [1 ]
Becchi, Michela
Agrawal, Gagan [1 ]
Chakradhar, Srimat
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
来源
HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING | 2011年
关键词
Cloud Computing; Virtualization; GPU; CUDA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Driven by the emergence of GPUs as a major player in high performance computing and the rapidly growing popularity of cloud environments, GPU instances are now being offered by cloud providers. The use of GPUs in a cloud environment, however, is still at initial stages, and the challenge of making GPU a true shared resource in the cloud has not yet been addressed. This paper presents a framework to enable applications executing within virtual machines to transparently share one or more GPUs. Our contributions are twofold: we extend an open source GPU virtualization software to include efficient GPU sharing, and we propose solutions to the conceptual problem of GPU kernel consolidation. In particular, we introduce a method for computing the affinity score between two or more kernels, which provides an indication of potential performance improvements upon kernel consolidation. In addition, we explore molding as a means to achieve efficient GPU sharing also in the case of kernels with high or conflicting resource requirements. We use these concepts to develop an algorithm to efficiently map a set of kernels on a pair of GPUs. We extensively evaluate our framework using eight popular GPU kernels and two Fermi GPUs. We find that even when contention is high our consolidation algorithm is effective in improving the throughput, and that the runtime overhead of our framework is low.
引用
收藏
页码:217 / 228
页数:12
相关论文
共 32 条
  • [1] [Anonymous], 2009, IPDPS
  • [2] [Anonymous], 2009, Proceedings of the 3rd ACM Workshop on System-Level Virtualization for High Performance Computing, HPCVirt'09, page, DOI DOI 10.1145/1519138.1519141
  • [3] Baghsorkhi Sara., 2008, LCPC 2008
  • [4] Becchi M, 2010, SPAA '10: PROCEEDINGS OF THE TWENTY-SECOND ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, P82
  • [5] Chi-Keung Luk, 2009, Proceedings of the 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2009), P45
  • [6] Using moldability to improve the performance of supercomputer jobs
    Cirne, W
    Berman, F
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2002, 62 (10) : 1571 - 1601
  • [7] Deelman Ewa., 2008, SC 08, P1
  • [8] DIAO Y, 2002, AM CONTR C ACC ANCH, P4922
  • [9] Duato Jose, 2010, 2010 International Conference on High Performance Computing & Simulation (HPCS 2010), P224, DOI 10.1109/HPCS.2010.5547126
  • [10] Feitelson DG, 2005, LECT NOTES COMPUT SC, V3277, P1