Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators

被引:1
|
作者
Pal, Subhankar [1 ]
Venkataramani, Swagath [2 ]
Srinivasan, Viji [2 ]
Gopalakrishnan, Kailash [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
来源
2021 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2021) | 2021年
关键词
PERFORMANCE;
D O I
10.1109/ISPASS51385.2021.00046
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A prevalent challenge for Deep Learning (DL) accelerators is how they are programmed to sustain utilization without impacting end-user productivity. Little prior effort has been devoted to the effective management of their on-chip Scratch-Pad Memory (SPM) across the DL operations of a Deep Neural Network (DNN). This is especially critical due to trends in complex network topologies and the emergence of eager execution. This work demonstrates that there exists up to a 5.2x performance gap in DL inference to be bridged using SPM management, on a set of image, object and language networks. We propose OnSRAM, a novel SPM management framework integrated with a DL accelerator runtime. OnSRAM has two variants, viz. OnSRAM-Static, which works on static graphs to identify data structures that should be held on-chip based on their properties, and OnSRAM-Eager, which targets an eager execution model (no graph) and uses a speculative scheme to hold/discard data structures. On a prototypical DL accelerator, OnSRAM-Static and OnSRAM-Eager achieve reductions in inference latency (batch size of 1) of 1.02-4.8x and 1.02-3.1x, respectively, over a baseline with no SPM management.
引用
收藏
页码:240 / 242
页数:3
相关论文
共 50 条
  • [41] Physics-informed deep learning and linear programming for efficient optimization of combined cycle power plants
    Hosseini, Mohammad Mehdi
    Meguerdijian, Saro
    Golmohammadi, Azarang
    ELECTRIC POWER SYSTEMS RESEARCH, 2024, 232
  • [42] Efficient geospatial mapping of buildings, woodlands, water and roads from aerial imagery using deep learning
    Abbas, Sidra
    Almadhor, Ahmad
    Sampedro, Gabriel Avelino
    Alsubai, Shtwai
    Al Hejaili, Abdullah
    Straovska, Lubomira
    Zaidi, Monji Mohamed
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [43] Energy-Efficient Joint Task Assignment and Migration in Data Centers: A Deep Reinforcement Learning Approach
    Lou, Jiong
    Tang, Zhiqing
    Jia, Weijia
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 961 - 973
  • [44] A Study on Thermal Load Management in a Deep Geological Repository for Efficient Disposal of High Level Radioactive Waste
    Lee, Jongyoul
    Choi, Heuijoo
    Cho, Dongkeun
    JOURNAL OF NUCLEAR FUEL CYCLE AND WASTE TECHNOLOGY, 2022, 20 (04): : 469 - 488
  • [45] Holistic cold-start management in serverless computing cloud with deep learning for time series
    Nguyen, Tam n.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 153 : 312 - 325
  • [46] A Novel RMS-Driven Deep Reinforcement Learning for Optimized Portfolio Management in Stock Trading
    Sattar, Asma
    Sarwar, Amna
    Gillani, Saira
    Bukhari, Maryam
    Rho, Seungmin
    Faseeh, Muhammad
    IEEE ACCESS, 2025, 13 : 42813 - 42835
  • [47] Leveraging deep learning and computer vision technologies to enhance management of coastal fisheries in the Pacific region
    Shedrawi, George
    Magron, Franck
    Vigga, Bernard
    Bosserelle, Pauline
    Gislard, Sebastien
    Halford, Andrew R.
    Tiitii, Sapeti
    Fepuleai, Faasulu
    Molai, Chris
    Rota, Manibua
    Jalam, Shivam
    Fatongiatau, Viliami
    Sami, Abel P.
    Nikiari, Beia
    Sokach, Ada H. M.
    Joy, Lucy A.
    Li, Owen
    Steenbergen, Dirk J.
    Andrew, Neil L.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [48] Deep Learning in Building Management Systems over NDN: use case of Forwarding & HVAC Control
    Ayadi, Mohamed Issam
    Maizate, Abderrahim
    Ouzzif, Mohamed
    Mahmoudi, Charif
    2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 1192 - 1198
  • [49] A Study on the Forecast of Earning Management Based on Deep Learning by Reflecting Information on Corporate Litigation Cases
    Kang, Hyeon
    Kim, Hyungjoon
    Na, Hyung Jong
    IEEE ACCESS, 2024, 12 : 139097 - 139112
  • [50] Systematic Review on Deep Reinforcement Learning-Based Energy Management for Different Building Types
    Shaqour, Ayas
    Hagishima, Aya
    ENERGIES, 2022, 15 (22)