UVM Discard: Eliminating Redundant Memory Transfers for Accelerators

被引:3
作者
Zhu, Weixi [1 ]
Cox, Guilherme [2 ]
Vesely, Jan [2 ]
Hairgrove, Mark [2 ]
Cox, Alan L. [1 ]
Rixner, Scott [1 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
[2] NVIDIA, Santa Clara, CA USA
来源
2022 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2022) | 2022年
关键词
D O I
10.1109/IISWC55918.2022.00013
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An increasing number of applications benefit from heterogeneous hardware accelerators. Such accelerators often require the application to manually manage memory buffers on devices and transfer data between host and device buffers. A programming model that unifies the virtual address space across the host and devices is appealing because it enables automatic memory transfers and simplifies application-level programming. However, the automatic memory transfers can sometimes be redundant, which decreases performance. NVIDIA's UVM (unified virtual memory) driver provides a unified virtual address space for CPU-GPU programming. This paper identifies redundant memory transfers (RMTs) as a common performance issue with UVM. To address this issue, this paper proposes a data discard directive, and evaluates two implementations of that directive, UvmDiscard and UvmDiscardLazy. This directive exploits application-level knowledge to avoid RMTs. The implementations were integrated with NVIDIA's open-source UVM driver to demonstrate their usefulness on real-world CUDA UVM applications. For example, the use of the discard directive increases training throughput by 61.2% on a large deep learning application that oversubscribes GPU memory.
引用
收藏
页码:27 / 38
页数:12
相关论文
共 50 条
  • [41] Eliminating the memory blocking effect
    Leynes, P. Andrew
    Rass, Olga
    Landau, Joshua D.
    MEMORY, 2008, 16 (08) : 852 - 872
  • [42] A protocol-independent technique for eliminating redundant network traffic
    Spring, NT
    Wetherall, D
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2000, 30 (04) : 87 - 95
  • [43] Information filtering based on eliminating redundant diffusion and compensating balance
    Liu, Xiangchun
    Su, Xin
    Ma, Jinming
    Zhu, Yuxiao
    Zhu, Xuzhen
    Tian, Hui
    INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2019, 33 (13):
  • [45] Eliminating redundant range checks in GNAT using symbolic evaluation
    Blieberger, J
    Burgstaller, B
    RELIABLE SOFTWARE TECHNOLOGIES - ADA-EUROPE 2003, 2003, 2655 : 153 - 167
  • [46] Fast computational architectures to decrease redundant calculations - Eliminating redundant digit calculation and excluding useless data
    Imai, M
    Nozawa, T
    Fujibayashi, M
    Kotani, K
    Ohmi, T
    IEICE TRANSACTIONS ON ELECTRONICS, 1999, E82C (09) : 1707 - 1714
  • [47] Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management
    Go, Seokjin
    Lee, Hyunwuk
    Kim, Junsung
    Lee, Jiwon
    Yoon, Myung Kuk
    Ro, Won Woo
    2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 248 - 258
  • [48] ENHANCING MEMORY WITH REDUNDANT LABELS
    DALE, HCA
    MCGLAUGHLIN, A
    NATURE, 1970, 227 (5256) : 411 - +
  • [49] A memory for two - Virtual transfers
    Chauvel, P
    REVUE FRANCAISE DE PSYCHANALYSE, 1998, 62 (02): : 639 - 644
  • [50] Impact of Memory Bandwidth on the Performance of Accelerators
    Mishra, Sambit
    Chakravorty, Dhruva K.
    Perez, Lisa M.
    Dang, Francis
    Liu, Honggao
    Witherden, Freddie David
    PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2024, PEARC 2024, 2024,