DUB: Dynamic Underclocking and Bypassing in NoCs for Heterogeneous GPUWorkloads

被引:6
作者
Bharadwaj, Srikant [1 ,2 ]
Das, Shomit [1 ]
Eckert, Yasuko [1 ]
Oskin, Mark [3 ]
Krishna, Tushar [2 ]
机构
[1] Adv Micro Devices Inc, Santa Clara, CA 95054 USA
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
[3] Univ Washington, Seattle, WA 98195 USA
来源
2021 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2021) | 2021年
关键词
Dynamic Voltage Frequency Scaling (DVFS); Power Efficiency;
D O I
10.1145/3479876.3481590
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of graphics processing units (GPU) workloads can be sensitive to the various clock domains which are dynamically tunable in modern GPUs. In this work, we observe that GPU application performance is sensitive towards NoC clock frequencies and the sensitivity varies during the execution of GPU kernels. We note that this heterogeneity is not adapted well by traditional dynamic voltage frequency scaling (DVFS) techniques. To that end, we introduce DUB, Dynamic Underclocking and Bypassing technique, for such heterogeneous GPU workloads. We enable bypassing retimer flops and routers while underclocking the NoC frequency thus enabling high power savings at minimal performance loss. Compared to baseline we observe a 26% improvement in power savings with only 3% degradation in performance beating oracular DVFS techniques.
引用
收藏
页码:49 / 54
页数:6
相关论文
共 18 条
  • [11] Kar M, 2017, ICCAD-IEEE ACM INT, P743, DOI 10.1109/ICCAD.2017.8203851
  • [12] Krishna T, 2013, INT S HIGH PERF COMP, P378, DOI 10.1109/HPCA.2013.6522334
  • [13] Li T, 2014, IEEE INT SOC CONF, P130, DOI 10.1109/SOCC.2014.6948913
  • [14] Lowe-Power Jason, 2020, ARXIV200703152CSAR
  • [15] Naffziger Samuel, 2021, ACM IEEE ISCA, P57, DOI [10.1109/ISCA52012.2021.00014, DOI 10.1109/ISCA52012.2021.00014]
  • [16] Xi Chen, 2012, 2012 Sixth IEEE/ACM International Symposium on Networks-on-Chip (NoCS), P43, DOI 10.1109/NOCS.2012.12
  • [17] Yao Y, 2016, DES AUT TEST EUROPE, P1433
  • [18] Zhang Xianwei, 2020, DELTA VALIDATE GPU M, P97, DOI [10.1145/3422575.3422784, DOI 10.1145/3422575.3422784]