Characterizing Multi-Instance GPU for Machine Learning Workloads

被引:9
|
作者
Li, Baolin [1 ]
Gadepally, Viiay [2 ]
Samsi, Siddharth [2 ]
Tiwari, Devesh [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02173 USA
关键词
Machine Learning; GPU; Characterization;
D O I
10.1109/IPDPSW55747.2022.00124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As machine learning (ML) becomes more and more popular, datacenter operators use hardware accelerators such as GPUs to tackle the high computation demand of ML workloads. However, recent studies show that user-submitted jobs often underutilize the GPU streaming multiprocessor (SM) cores, resulting in hardware resource wastage. Motivated by this observation, GPU vendors have released software and hardware support for GPU resource sharing, for example, the NVIDIA Multi-Instance GPU (MIG) technique on A100 Tensor Core GPUs. In this work, we use several state-of-the-art deep learning (DL) models from various application areas to characterize the performance and energy consumption of the A100 GPU MIG mode operation. Our characterization reveals valuable insights into operating a MIG-enabled GPU datacenter.
引用
收藏
页码:724 / 731
页数:8
相关论文
共 50 条
  • [1] Optimal Margin Distribution Machine for Multi-Instance Learning
    Zhang, Teng
    Jin, Hai
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2383 - 2389
  • [2] Improving Multi-Instance Multi-Label Learning by Extreme Learning Machine
    Yin, Ying
    Zhao, Yuhai
    Li, Chengguang
    Zhang, Bin
    APPLIED SCIENCES-BASEL, 2016, 6 (06):
  • [3] Optimal Representative Distribution Margin Machine for Multi-Instance Learning
    Luan, Tianxiang
    Luo, Tingjin
    Zhuge, Wenzhang
    Hou, Chenping
    IEEE ACCESS, 2020, 8 (08): : 74864 - 74874
  • [4] Chemical complexity challenge: Is multi-instance machine learning a solution?
    Zankov, Dmitry
    Madzhidov, Timur
    Varnek, Alexandre
    Polishchuk, Pavel
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2024, 14 (01)
  • [5] Scalable Multi-Instance Learning
    Wei, Xiu-Shen
    Wu, Jianxin
    Zhou, Zhi-Hua
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 1037 - 1042
  • [6] Multi-Instance Learning with Key Instance Shift
    Zhang, Ya-Lin
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3441 - 3447
  • [7] EFFICIENT INSTANCE ANNOTATION IN MULTI-INSTANCE LEARNING
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 137 - 140
  • [8] Multi-Objective Multi-Instance Learning: A New Approach to Machine Learning for eSports
    Birant, Kokten Ulas
    Birant, Derya
    ENTROPY, 2023, 25 (01)
  • [9] MIGER: Integrating Multi-Instance GPU and Multi-Process Service for Deep Learning Clusters
    Zhang, Bowen
    Li, Shuxin
    Li, Zhuozhao
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 504 - 513
  • [10] Instance Annotation for Multi-Instance Multi-Label Learning
    Briggs, Forrest
    Fern, Xiaoli Z.
    Raich, Raviv
    Lou, Qi
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (03)