DUB: Dynamic Underclocking and Bypassing in NoCs for Heterogeneous GPUWorkloads

被引:6
作者
Bharadwaj, Srikant [1 ,2 ]
Das, Shomit [1 ]
Eckert, Yasuko [1 ]
Oskin, Mark [3 ]
Krishna, Tushar [2 ]
机构
[1] Adv Micro Devices Inc, Santa Clara, CA 95054 USA
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
[3] Univ Washington, Seattle, WA 98195 USA
来源
2021 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2021) | 2021年
关键词
Dynamic Voltage Frequency Scaling (DVFS); Power Efficiency;
D O I
10.1145/3479876.3481590
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of graphics processing units (GPU) workloads can be sensitive to the various clock domains which are dynamically tunable in modern GPUs. In this work, we observe that GPU application performance is sensitive towards NoC clock frequencies and the sensitivity varies during the execution of GPU kernels. We note that this heterogeneity is not adapted well by traditional dynamic voltage frequency scaling (DVFS) techniques. To that end, we introduce DUB, Dynamic Underclocking and Bypassing technique, for such heterogeneous GPU workloads. We enable bypassing retimer flops and routers while underclocking the NoC frequency thus enabling high power savings at minimal performance loss. Compared to baseline we observe a 26% improvement in power savings with only 3% degradation in performance beating oracular DVFS techniques.
引用
收藏
页码:49 / 54
页数:6
相关论文
共 18 条
  • [1] Alsop J, 2019, I S WORKL CHAR PROC, P243, DOI 10.1109/IISWC47752.2019.9041977
  • [2] AMD, 2017, Radeons Next-generation Vega Architecture
  • [3] Kite: A Family of Heterogeneous Interposer Topologies Enabled via Accurate Interconnect Modeling
    Bharadwaj, Srikant
    Yin, Jieming
    Beckmann, Bradford
    Krishna, Tushar
    [J]. PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [4] Scalable Distributed Last-Level TLBs Using Low-Latency Interconnects
    Bharadwaj, Srikant
    Cox, Guilherme
    Krishna, Tushar
    Bhattacharjee, Abhishek
    [J]. 2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 271 - 284
  • [5] Binkert Nathan, 2011, Computer Architecture News, V39, P1, DOI 10.1145/2024716.2024718
  • [6] Choquette Jack, IEEE MICRO
  • [7] Dynamic Voltage and Frequency Scaling in NoCs with Supervised and Reinforcement Learning Techniques
    Fettes, Quintin
    Clark, Mark
    Bunescu, Razvan
    Karanth, Avinash
    Louri, Ahmed
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (03) : 375 - 389
  • [8] Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level
    Gutierrez, Anthony
    Beckmann, Bradford M.
    Dutu, Alexandru
    Gross, Joseph
    Kalamatianos, John
    Kayiran, Onur
    LeBeane, Michael
    Poremba, Matthew
    Potter, Brandon
    Puthoor, Sooraj
    Sinclair, Matthew D.
    Wyse, Mark
    Yin, Jieming
    Zhang, Xianwei
    Jain, Akshay
    Rogers, Timothy G.
    [J]. 2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 608 - 619
  • [9] Hesse Robert, 2015, PROC 9 INT S NETW CH, P1
  • [10] Jerger Natalie D. Enright, 2017, ONCHIP NETWORKS