Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training

被引:13
|
作者
Choi, Hyeonseong [1 ]
Lee, Jaehwan [1 ]
机构
[1] Korea Aerosp Univ, Sch Elect & Informat Engn, Goyang Si 10540, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 21期
基金
新加坡国家研究基金会;
关键词
deep learning; large-scale model; CUDA Unified Memory; PyTorch;
D O I
10.3390/app112110377
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
To achieve high accuracy when performing deep learning, it is necessary to use a large-scale training model. However, due to the limitations of GPU memory, it is difficult to train large-scale training models within a single GPU. NVIDIA introduced a technology called CUDA Unified Memory with CUDA 6 to overcome the limitations of GPU memory by virtually combining GPU memory and CPU memory. In addition, in CUDA 8, memory advise options are introduced to efficiently utilize CUDA Unified Memory. In this work, we propose a newly optimized scheme based on CUDA Unified Memory to efficiently use GPU memory by applying different memory advise to each data type according to access patterns in deep learning training. We apply CUDA Unified Memory technology to PyTorch to see the performance of large-scale learning models through the expanded GPU memory. We conduct comprehensive experiments on how to efficiently utilize Unified Memory by applying memory advises when performing deep learning. As a result, when the data used for deep learning are divided into three types and a memory advise is applied to the data according to the access pattern, the deep learning execution time is reduced by 9.4% compared to the default Unified Memory.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Near-Field Beam Training for Extremely Large-Scale MIMO Based on Deep Learning
    Nie, Jiali
    Cui, Yuanhao
    Yang, Zhaohui
    Yuan, Weijie
    Jing, Xiaojun
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (01) : 352 - 362
  • [22] Near-Field Beam Training Based on Deep Learning for Extremely Large-Scale MIMO
    Jiang, Guoli
    Qi, Chenhao
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (08) : 2063 - 2067
  • [23] Designing an Efficient Framework for Large-Scale Data Processing and Analysis Based on Deep Learning Technology
    Liu, Qian
    Wang, Xingda
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 269 - 274
  • [24] Efficient Scheduling in Training Deep Convolution Networks at Large Scale
    Que, Can
    Zhang, Xinming
    IEEE ACCESS, 2018, 6 : 61452 - 61456
  • [25] Efficient Multi-Training Framework of Image Deep Learning on GPU Cluster
    Chen, Chun-Fu
    Lee, Gwo Giun
    Xia, Yinglong
    Lin, W. Sabrina
    Suzumura, Toyotaro
    Lin, Ching-Yung
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 489 - 494
  • [26] Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems
    Dryden, Nikoli
    Maruyama, Naoya
    Moon, Tim
    Benson, Tom
    Yoo, Andy
    Snir, Marc
    Van Essen, Brian
    PROCEEDINGS OF 2018 IEEE/ACM MACHINE LEARNING IN HPC ENVIRONMENTS (MLHPC 2018), 2018, : 1 - 13
  • [27] Scheduling Large-scale Distributed Training via Reinforcement Learning
    Peng, Zhanglin
    Ren, Jiamin
    Zhang, Ruimao
    Wu, Lingyun
    Wang, Xinjiang
    Luo, Ping
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1797 - 1806
  • [28] DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training
    Wang, Lipeng
    Ye, Songgao
    Yang, Baichen
    Lu, Youyou
    Zhang, Hequan
    Yan, Shengen
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [29] A FAST AND PRECISE METHOD FOR LARGE-SCALE LAND-USE MAPPING BASED ON DEEP LEARNING
    Yang, Xuan
    Chen, Zhengchao
    Li, Baipeng
    Peng, Dailiang
    Chen, Pan
    Zhang, Bing
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5913 - 5916
  • [30] Large-Scale Deep Learning for Building Intelligent Computer Systems
    Dean, Jeff
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 1 - 1