Characterizing Multi-Instance GPU for Machine Learning Workloads

被引:9
|
作者
Li, Baolin [1 ]
Gadepally, Viiay [2 ]
Samsi, Siddharth [2 ]
Tiwari, Devesh [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02173 USA
关键词
Machine Learning; GPU; Characterization;
D O I
10.1109/IPDPSW55747.2022.00124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As machine learning (ML) becomes more and more popular, datacenter operators use hardware accelerators such as GPUs to tackle the high computation demand of ML workloads. However, recent studies show that user-submitted jobs often underutilize the GPU streaming multiprocessor (SM) cores, resulting in hardware resource wastage. Motivated by this observation, GPU vendors have released software and hardware support for GPU resource sharing, for example, the NVIDIA Multi-Instance GPU (MIG) technique on A100 Tensor Core GPUs. In this work, we use several state-of-the-art deep learning (DL) models from various application areas to characterize the performance and energy consumption of the A100 GPU MIG mode operation. Our characterization reveals valuable insights into operating a MIG-enabled GPU datacenter.
引用
收藏
页码:724 / 731
页数:8
相关论文
共 50 条
  • [41] Multi-Instance Learning with Any Hypothesis Class
    Sabato, Sivan
    Tishby, Naftali
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 2999 - 3039
  • [42] Multi-Instance Learning from Supervised View
    Zhi-Hua Zhou
    Journal of Computer Science and Technology, 2006, 21 : 800 - 809
  • [43] Multi-Instance Learning Based Web Mining
    Zhi-Hua Zhou
    Kai Jiang
    Ming Li
    Applied Intelligence, 2005, 22 : 135 - 147
  • [44] Multi-instance learning from supervised view
    Zhou, Zhi-Hua
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2006, 21 (05) : 800 - 809
  • [45] Query-Driven Multi-Instance Learning
    Hsu, Yen-Chi
    Hong, Cheng-Yao
    Lee, Ming-Sui
    Liu, Tyng-Luh
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4158 - 4165
  • [46] A multi-instance multi-label learning algorithm based on instance correlations
    Liu, Chanjuan
    Chen, Tongtong
    Ding, Xinmiao
    Zou, Hailin
    Tong, Yan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (19) : 12263 - 12284
  • [47] Bag dissimilarity regularized multi-instance learning
    Huang, Shiluo
    Liu, Zheng
    Jin, Wei
    Mu, Ying
    PATTERN RECOGNITION, 2022, 126
  • [48] Multi-instance learning based web mining
    Zhou, ZH
    Jiang, K
    Li, M
    APPLIED INTELLIGENCE, 2005, 22 (02) : 135 - 147
  • [49] Domain transfer multi-instance dictionary learning
    Wang, Ke
    Liu, Jiayong
    Gonzalez, Daniel
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S983 - S992
  • [50] Domain transfer multi-instance dictionary learning
    Ke Wang
    Jiayong Liu
    Daniel González
    Neural Computing and Applications, 2017, 28 : 983 - 992