Accurately modeling the on-chip and off-chip GPU memory subsystem

被引:12
|
作者
Candel, Francisco [1 ]
Petit, Salvador [1 ]
Sahuquillo, Julio [1 ]
Duato, Jose [1 ]
机构
[1] Univ Politecn Valencia, Dept Comp Engn, Valencia 46012, Spain
关键词
Applied modeling and simulation; On-chip memory subsystem; Main memory controller; GDDR; Cache coherence protocol;
D O I
10.1016/j.future.2017.02.012
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Research on GPU architecture is becoming pervasive in both the academia and the industry because these architectures offer much more performance per watt than typical CPU architectures. This is the main reason why massive deployment of GPU multiprocessors is considered one of the most feasible solutions to attain exascale computing capabilities. The memory hierarchy of the GPU is a critical research topic, since its design goals widely differ from those of conventional CPU memory hierarchies. Researchers typically use detailed microarchitectural simulators to explore novel designs to better support GPGPU computing as well as to improve the performance of GPU and CPU-GPU systems. In this context, the memory hierarchy is a critical and continuously evolving subsystem. Unfortunately, the fast evolution of current memory subsystems deteriorates the accuracy of existing state-of-the-art simulators. This paper focuses on accurately modeling the entire (both on-chip and off-chip) GPU memory subsystem. For this purpose, we identify four main memory related components that impact on the overall performance accuracy. Three of them belong to the on-chip memory hierarchy: (i) memory request coalescing mechanisms, (ii) miss status holding registers, and (iii) cache coherence protocol; while the fourth component refers to the memory controller and GDDR memory working activity. To evaluate and quantify our claims, we accurately modeled the aforementioned memory components in an extended version of the state-of-the-art Multi2Sim heterogeneous CPUGPU processor simulator. Experimental results show important deviations, which can vary the final system performance provided by the simulation framework up to a factor of three. The proposed GPU model has been compared and validated against the original framework and the results from a real AMD Southern-Islands 7870HD GPU. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:510 / 519
页数:10
相关论文
共 50 条
  • [1] Optimum voltage swing on on-chip and off-chip interconnect
    Svensson, C
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2001, 36 (07) : 1108 - 1112
  • [2] Hypermultiplexed Off-Chip Hologram by On-Chip Integrated Metasurface
    Liu, Xianjin
    Ma, Zhanying
    Zhang, Dasen
    Bao, Qiwen
    Liu, Zhenzhen
    Xiao, Jun-Jun
    ADVANCED OPTICAL MATERIALS, 2024, 12 (28):
  • [3] On-chip versus off-chip test: An artificial dichotomy
    Aitken, RC
    INTERNATIONAL TEST CONFERENCE 1998, PROCEEDINGS, 1998, : 1146 - 1146
  • [4] Accurately Modeling the GPU Memory Subsystem
    Candel, Francisco
    Petit, Salvador
    Sahuquillo, Julio
    Duato, Jose
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 179 - 186
  • [5] Introduction to the Special Section on On-Chip and Off-Chip Network Architectures
    Cardo, Jose Flich
    Palesi, Maurizio
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 12 (04)
  • [6] Design of On-chip and off-chip interfaces for a GALS NoC architecture
    Beigne, E.
    Vivet, P.
    12TH IEEE INTERNATIONAL SYMPOSIUM ON ASYNCHRONOUS CIRCUITS AND SYSTEMS, PROCEEDINGS, 2006, : 172 - 181
  • [7] New debug strategies combine on-chip and off-chip tools
    Garrett, B
    Won, MS
    EDN, 2000, 45 (17) : 105 - +
  • [8] Materials and Devices for On-Chip and Off-Chip Peltier Cooling: A Review
    Nimmagadda, Lakshmi Amulya
    Mahmud, Rifat
    Sinha, Sanjiv
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2021, 11 (08): : 1267 - 1281
  • [9] AIREN: A Novel Integration of On-Chip and Off-Chip FPGA Networks
    Schmidt, Andrew G.
    Kritikos, William V.
    Sharma, Rahul R.
    Sass, Ron
    PROCEEDINGS OF THE 2009 17TH IEEE SYMPOSIUM ON FIELD PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2009, : 271 - 274
  • [10] Comparison of on-chip and off-chip microfluidic kinase assay formats
    Dunne, J
    Reardon, H
    Trinh, V
    Li, E
    Farinas, J
    ASSAY AND DRUG DEVELOPMENT TECHNOLOGIES, 2004, 2 (02) : 121 - 129