Characterizing Multi-Instance GPU for Machine Learning Workloads

被引:9
|
作者
Li, Baolin [1 ]
Gadepally, Viiay [2 ]
Samsi, Siddharth [2 ]
Tiwari, Devesh [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02173 USA
关键词
Machine Learning; GPU; Characterization;
D O I
10.1109/IPDPSW55747.2022.00124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As machine learning (ML) becomes more and more popular, datacenter operators use hardware accelerators such as GPUs to tackle the high computation demand of ML workloads. However, recent studies show that user-submitted jobs often underutilize the GPU streaming multiprocessor (SM) cores, resulting in hardware resource wastage. Motivated by this observation, GPU vendors have released software and hardware support for GPU resource sharing, for example, the NVIDIA Multi-Instance GPU (MIG) technique on A100 Tensor Core GPUs. In this work, we use several state-of-the-art deep learning (DL) models from various application areas to characterize the performance and energy consumption of the A100 GPU MIG mode operation. Our characterization reveals valuable insights into operating a MIG-enabled GPU datacenter.
引用
收藏
页码:724 / 731
页数:8
相关论文
共 50 条
  • [21] Multi-Instance Learning for Bankruptcy Prediction
    Kotsiantis, Sotiris
    Kanellopoulos, Dimitris
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 1007 - +
  • [22] Multi-instance clustering with applications to multi-instance prediction
    Min-Ling Zhang
    Zhi-Hua Zhou
    Applied Intelligence, 2009, 31 : 47 - 68
  • [23] Feature Selection in Multi-instance Learning
    Zhang, Chun-Hua
    Tan, Jun-Yan
    Deng, Nai-Yang
    OPERATIONS RESEARCH AND ITS APPLICATIONS, 2010, 12 : 462 - +
  • [24] Multi-instance clustering with applications to multi-instance prediction
    Zhang, Min-Ling
    Zhou, Zhi-Hua
    APPLIED INTELLIGENCE, 2009, 31 (01) : 47 - 68
  • [25] Feature selection in multi-instance learning
    Gan, Rui
    Yin, Jian
    NEURAL COMPUTING & APPLICATIONS, 2013, 23 (3-4): : 907 - 912
  • [26] A review of multi-instance learning assumptions
    Foulds, James
    Frank, Eibe
    KNOWLEDGE ENGINEERING REVIEW, 2010, 25 (01): : 1 - 25
  • [27] Constrained instance clustering in multi-instance multi-label learning
    Pei, Yuanli
    Fern, Xiaoli Z.
    PATTERN RECOGNITION LETTERS, 2014, 37 : 107 - 114
  • [28] MIMLTWSVM: Twin Support Vector Machine for Multi-Instance Multi-Label learning
    Tomar, Divya
    Agarwal, Sonali
    2016 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2016, : 492 - 497
  • [29] Learnability of multi-instance multi-label learning
    Wang Wei
    Zhou ZhiHua
    CHINESE SCIENCE BULLETIN, 2012, 57 (19): : 2488 - 2491
  • [30] Multi-instance learning based on representative instance and feature mapping
    Wang, Xingqi
    Wei, Dan
    Cheng, Hui
    Fang, Jinglong
    NEUROCOMPUTING, 2016, 216 : 790 - 796