Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators

被引:1
|
作者
Pal, Subhankar [1 ]
Venkataramani, Swagath [2 ]
Srinivasan, Viji [2 ]
Gopalakrishnan, Kailash [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
来源
2021 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2021) | 2021年
关键词
PERFORMANCE;
D O I
10.1109/ISPASS51385.2021.00046
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A prevalent challenge for Deep Learning (DL) accelerators is how they are programmed to sustain utilization without impacting end-user productivity. Little prior effort has been devoted to the effective management of their on-chip Scratch-Pad Memory (SPM) across the DL operations of a Deep Neural Network (DNN). This is especially critical due to trends in complex network topologies and the emergence of eager execution. This work demonstrates that there exists up to a 5.2x performance gap in DL inference to be bridged using SPM management, on a set of image, object and language networks. We propose OnSRAM, a novel SPM management framework integrated with a DL accelerator runtime. OnSRAM has two variants, viz. OnSRAM-Static, which works on static graphs to identify data structures that should be held on-chip based on their properties, and OnSRAM-Eager, which targets an eager execution model (no graph) and uses a speculative scheme to hold/discard data structures. On a prototypical DL accelerator, OnSRAM-Static and OnSRAM-Eager achieve reductions in inference latency (batch size of 1) of 1.02-4.8x and 1.02-3.1x, respectively, over a baseline with no SPM management.
引用
收藏
页码:240 / 242
页数:3
相关论文
共 50 条
  • [1] DRDU: A data reuse analysis technique for efficient scratch-pad memory management
    Issenin, Ilya
    Brockmeyer, Erik
    Miranda, Miguel
    Dutt, Nikil
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (02)
  • [2] Efficient Dynamic Heap Allocation of Scratch-Pad Memory
    Mcllroy, Ross
    Dickman, Peter
    Sventek, Joe
    ISMM'08: PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON MEMORY MANAGEMENT, 2008, : 31 - +
  • [3] Compiler-Directed Leakage Energy Reduction for Instruction Scratch-Pad Memories
    Yijie Huangfu
    Wei Zhang
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2014), 2015, : 392 - 399
  • [4] Memory Allocation for Embedded Systems with a Compile-Time-Unknown Scratch-Pad Size
    Nguyen, Nghi
    Dominguez, Angel
    Barua, Rajeev
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2009, 8 (03)
  • [5] OnSRAM: Efficient Inter-Node On-Chip Scratchpad Management in Deep Learning Accelerators
    Pal, Subhankar
    Venkataramani, Swagath
    Srinivasan, Viji
    Gopalakrishnan, Kailash
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (06)
  • [6] Adaptive Scratch Pad Memory Management for Dynamic Behavior of Multimedia Applications
    Cho, Doosan
    Pasricha, Sudeep
    Issenin, Ilya
    Dutt, Nikil D.
    Ahn, Minwook
    Paek, Yunheung
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2009, 28 (04) : 554 - 567
  • [7] A review of emerging trends in photonic deep learning accelerators
    Atwany, Mohammad
    Pardo, Sarah
    Serunjogi, Solomon
    Rasras, Mahmoud
    FRONTIERS IN PHYSICS, 2024, 12
  • [8] Towards Energy Efficient Hybrid On-chip Scratch Pad Memory with Non-Volatile Memory
    Hu, Jingtong
    Xue, Chun Jason
    Zhuge, Qingfeng
    Tseng, Wei-Che
    Sha, Edwin H. -M.
    2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 746 - 751
  • [9] Efficient Compression Technique for NoC-based Deep Neural Network Accelerators
    Lorandel, Jordane
    Lahdhiri, Habiba
    Bourdel, Emmanuelle
    Monteleone, Salvatore
    Palesi, Maurizio
    2020 23RD EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2020), 2020, : 174 - 179
  • [10] AutoMap: Automatic Mapping of Neural Networks to Deep Learning Accelerators for Edge Devices
    Wang, Yanhong
    Zhao, Zihao
    Jin, Xu
    Zheng, Haotian
    Nie, Maohua
    Zou, Qiaosha
    Shi, C. -J. Richard
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (09) : 2994 - 3006