UVM Discard: Eliminating Redundant Memory Transfers for Accelerators

被引:3
作者
Zhu, Weixi [1 ]
Cox, Guilherme [2 ]
Vesely, Jan [2 ]
Hairgrove, Mark [2 ]
Cox, Alan L. [1 ]
Rixner, Scott [1 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
[2] NVIDIA, Santa Clara, CA USA
来源
2022 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2022) | 2022年
关键词
D O I
10.1109/IISWC55918.2022.00013
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An increasing number of applications benefit from heterogeneous hardware accelerators. Such accelerators often require the application to manually manage memory buffers on devices and transfer data between host and device buffers. A programming model that unifies the virtual address space across the host and devices is appealing because it enables automatic memory transfers and simplifies application-level programming. However, the automatic memory transfers can sometimes be redundant, which decreases performance. NVIDIA's UVM (unified virtual memory) driver provides a unified virtual address space for CPU-GPU programming. This paper identifies redundant memory transfers (RMTs) as a common performance issue with UVM. To address this issue, this paper proposes a data discard directive, and evaluates two implementations of that directive, UvmDiscard and UvmDiscardLazy. This directive exploits application-level knowledge to avoid RMTs. The implementations were integrated with NVIDIA's open-source UVM driver to demonstrate their usefulness on real-world CUDA UVM applications. For example, the use of the discard directive increases training throughput by 61.2% on a large deep learning application that oversubscribes GPU memory.
引用
收藏
页码:27 / 38
页数:12
相关论文
共 50 条
  • [31] Guarded Type Promotion: Eliminating redundant casts in Java
    Department of Computer Science, Aarhus University, Denmark
    ECOOP Workshop Proc. - Workshop Form. Tech. Java-Like Programs, FTfJP,
  • [32] Eliminating redundant columns in continuous tour scheduling problems
    Brusco, MJ
    Jacobs, LW
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 111 (03) : 518 - 525
  • [33] Data-Triggered Threads: Eliminating Redundant Computation
    Tseng, Hung-Wei
    Tullsen, Dean M.
    2011 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2011, : 181 - 192
  • [34] Accurate and diverse recommendations via eliminating redundant correlations
    Zhou, Tao
    Su, Ri-Qi
    Liu, Run-Ran
    Jiang, Luo-Luo
    Wang, Bing-Hong
    Zhang, Yi-Cheng
    NEW JOURNAL OF PHYSICS, 2009, 11
  • [35] Unsupervised forward selection: A method for eliminating redundant variables
    Whitley, DC
    Ford, MG
    Livingstone, DJ
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (05): : 1160 - 1168
  • [36] A NOTE ON COHENS ELIMINATING REDUNDANT RECURSIVE CALLS - REPLY
    COHEN, NH
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1985, 7 (04): : 685 - 685
  • [37] MEMORY FOR REDUNDANT INFORMATION
    POTTS, GR
    MEMORY & COGNITION, 1973, 1 (04) : 467 - 470
  • [38] A test synthesis technique using redundant register transfers
    Papachristou, C
    Baklashov, M
    1997 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN - DIGEST OF TECHNICAL PAPERS, 1997, : 414 - 420
  • [39] Failed back surgery syndrome: whose failure? Time to discard a redundant term
    Lucas, Andrew J.
    BRITISH JOURNAL OF PAIN, 2012, 6 (04) : 162 - 165
  • [40] Implementation and Verification of A Generic Universal Memory Controller Based On UVM
    Khalifa, Khaled
    Salah, Khaled
    2015 10TH IEEE INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA (DTIS), 2015,