Sustainable AI Processing at the Edge

被引:3
作者
Ollivier, Sebastien [1 ]
Li, Sheng [2 ]
Tang, Yue [3 ]
Cahoon, Stephen [4 ]
Caginalp, Ryan [3 ]
Chaudhuri, Chayanika [3 ]
Zhou, Peipei [5 ]
Tang, Xulong [2 ]
Hu, Jingtong [5 ]
Jones, Alex K. [6 ]
机构
[1] Univ Pittsburgh, Pittsburgh, PA 15260 USA
[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
[3] Univ Pittsburgh, Elect & Comp Engn, Pittsburgh, PA 15260 USA
[4] Univ Pittsburgh, Comp Engn, Pittsburgh, PA 15260 USA
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
[6] Univ Pittsburgh, Elect & Comp Engn & Comp Sci, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Artificial intelligence; Edge computing; Memory management; Sustainable development; Measurement; Fabrication; Costs; PERFORMANCE; ENERGY;
D O I
10.1109/MM.2022.3220399
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Edge computing is a popular paradigm for accelerating light- to medium-weight machine learning algorithms initiated from mobile devices without requiring the long communication latencies to send them to remote datacenters in the cloud. Edge servers primarily consider traditional concerns, such as size, weight, and power constraints for their installations. However, such metrics are not entirely sufficient to consider environmental impacts from computing given the significant contributions from embodied energy and carbon. In this article we explore the tradeoffs of hardware strategies for convolutional neural network acceleration engines considering inference and online training. In particular, we explore the use of mobile graphics processing unit (GPU) accelerators, recently released edge-class field-programmable gate arrays, and novel processing in memory (PIM) using dynamic random-access memory (DRAM) and emerging Racetrack memory. Given edge servers already employ DRAM and sometimes GPU accelerators, we consider the sustainability implications using breakeven analysis of replacing or augmenting DDR3 with Racetrack memory. We also consider the implications for provisioning edge servers with different accelerators using indifference analysis. While mobile GPUs are typically much more energy efficient, their significant embodied energy can make them less sustainable than PIM solutions in certain scenarios that consider activity time and compute effort.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 23 条
  • [1] DTCO including Sustainability: Power-Performance-Area-Cost-Environmental score (PPACE) Analysis for Logic Technologies
    Bardon, M. Garcia
    Wuytens, P.
    Ragnarsson, L-A
    Mirabelli, G.
    Jang, D.
    Willems, G.
    Mallik, A.
    Spessot, A.
    Ryckaert, J.
    Parvais, B.
    [J]. 2020 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2020,
  • [2] Bayram I., 2016, 2016 7 INT GREEN SUS, P1
  • [3] Bennis R., 2019, LIFE CYCLE ASSESSMEN
  • [4] Magnetic Racetrack Memory: From Physics to the Cusp of Applications Within a Decade
    Blasing, Robin
    Khan, Asif Ali
    Filippou, Panagiotis Ch
    Garg, Chirag
    Hameed, Fazal
    Castrillon, Jeronimo
    Parkin, Stuart S. P.
    [J]. PROCEEDINGS OF THE IEEE, 2020, 108 (08) : 1303 - 1321
  • [5] Boyd Sarah B., 2011, Life-cycle Assessment of Semiconductors
  • [6] Brunvand E., 2018, 2018 Ninth International Green and Sustainable Computing Conference, DOI DOI 10.1109/IGCC.2018.8752110
  • [7] NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory
    Dong, Xiangyu
    Xu, Cong
    Xie, Yuan
    Jouppi, Norman P.
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2012, 31 (07) : 994 - 1007
  • [8] Higgs T, 2009, IEEE I SYMP SUST SYS, P228, DOI 10.1109/ISSST.2009.5156786
  • [9] ISO14044, 2006, ENV MAN LIF CYCL ASS
  • [10] Jones AK, 2013, ICCAD-IEEE ACM INT, P206, DOI 10.1109/ICCAD.2013.6691120