Characterizing Multi-Instance GPU for Machine Learning Workloads

被引:9
|
作者
Li, Baolin [1 ]
Gadepally, Viiay [2 ]
Samsi, Siddharth [2 ]
Tiwari, Devesh [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02173 USA
关键词
Machine Learning; GPU; Characterization;
D O I
10.1109/IPDPSW55747.2022.00124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As machine learning (ML) becomes more and more popular, datacenter operators use hardware accelerators such as GPUs to tackle the high computation demand of ML workloads. However, recent studies show that user-submitted jobs often underutilize the GPU streaming multiprocessor (SM) cores, resulting in hardware resource wastage. Motivated by this observation, GPU vendors have released software and hardware support for GPU resource sharing, for example, the NVIDIA Multi-Instance GPU (MIG) technique on A100 Tensor Core GPUs. In this work, we use several state-of-the-art deep learning (DL) models from various application areas to characterize the performance and energy consumption of the A100 GPU MIG mode operation. Our characterization reveals valuable insights into operating a MIG-enabled GPU datacenter.
引用
收藏
页码:724 / 731
页数:8
相关论文
共 50 条
  • [31] Multi-instance Learning based on Instance Consistency for Image Retrieval
    Zhang, Miao
    Wu, Zhize
    Wan, Shouhong
    Yue, Lihua
    Yin, Bangjie
    NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [32] Instance Explainable Multi-instance Learning for ROI of Various Data
    Zhao, Xu
    Wang, Zihao
    Zhang, Yong
    Xing, Chunxiao
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 107 - 124
  • [33] Instance-Level Label Propagation with Multi-Instance Learning
    Wang, Qifan
    Chechik, Gal
    Sun, Chen
    Shen, Bin
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2943 - 2949
  • [34] Learnability of multi-instance multi-label learning
    WANG Wei & ZHOU ZhiHua National Key Laboratory for Novel Software Technology
    Chinese Science Bulletin, 2012, 57 (19) : 2492 - 2495
  • [35] Multi-Instance Multi-Label Active Learning
    Huang, Sheng-Jun
    Gao, Nengneng
    Chen, Songcan
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1886 - 1892
  • [36] Fast Multi-Instance Multi-Label Learning
    Huang, Sheng-Jun
    Gao, Wei
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2614 - 2627
  • [37] Active Multi-Instance Multi-Label Learning
    Retz, Robert
    Schwenker, Friedhelm
    ANALYSIS OF LARGE AND COMPLEX DATA, 2016, : 91 - 101
  • [38] Fast Multi-Instance Multi-Label Learning
    Huang, Sheng-Jun
    Gao, Wei
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1868 - 1874
  • [39] Robust and Discriminative Distance for Multi-Instance Learning
    Wang, Hua
    Nie, Feiping
    Huang, Heng
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2919 - 2924
  • [40] Multi-Instance Learning With Emerging Novel Class
    Wei, Xiu-Shen
    Ye, Han-Jia
    Mu, Xin
    Wu, Jianxin
    Shen, Chunhua
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 2109 - 2120