Bus Width Aware Off-Chip Memory Access Minimization for CNN Accelerators

被引:5
|
作者
Tewari, Saurabh [1 ]
Kumar, Anshul [1 ]
Paul, Kolin [1 ]
机构
[1] Indian Inst Technol Delhi, Dept Comp Sci & Engn, New Delhi, India
来源
2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020) | 2020年
关键词
Neural Network Accelerators; CNN; Energy Efficiency; Off-Chip Memory Accesses; Bus Width;
D O I
10.1109/ISVLSI49217.2020.00051
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Network (CNN) accelerators have gained popularity due to their ability to speed up the CNN based applications. However, the energy efficiency of these accelerators is limiting their ubiquitous usage in energy-constrained devices. A significant fraction of their energy consumption results from off-chip memory accesses. In order to get high throughput, these accelerators connect to off-chip memory by a wide data bus. However, accessing the data of size, not a multiple of the bus width, results in wastage of energy. We observed that off-chip memory accesses could be reduced significantly by partitioning the data that optimally utilizes bus width and increases the number of aligned accesses. In this work, we propose a bus width aware approach to determine the optimal partition of the convolution layers to reduce the off-chip memory accesses. Our tool evaluates the off-chip memory accesses for different data partitions, and data reuse schemes to find the optimal partition. We have experimented with two popular CNNs, VGG16 and AlexNet. Our approach reduces off-chip memory accesses of VGG16 by 16% and 29% and of AlexNet by 9% and 16% on 64 and 128 bits data bus, respectively, compared to the state of the art approach.
引用
收藏
页码:240 / 245
页数:6
相关论文
共 50 条
  • [1] Minimizing Off-Chip Memory Access for CNN Accelerators
    Tewari, Saurabh
    Kumar, Anshul
    Paul, Kolin
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (03) : 95 - 104
  • [2] An Off-Chip Attack on Hardware Enclaves via the Memory Bus
    Lee, Dayeol
    Jung, Dongha
    Fang, Ian T.
    Tsai, Chia-Che
    Popa, Raluca Ada
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 487 - 504
  • [3] Exploring Wireless Technology for Off-Chip Memory Access
    Sikder, Md Ashif I.
    DiTomaso, Dominic
    Kodi, Avinash
    Kaya, Savas
    Rayess, William
    Matolak, David
    2016 IEEE 24TH ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI), 2016, : 92 - 99
  • [4] SmartShuttle: Optimizing Off-Chip Memory Accesses for Deep Learning Accelerators
    Li, Jiajun
    Yan, Guihai
    Lu, Wenyan
    Jiang, Shuhao
    Gong, Shijun
    Wu, Jingya
    Li, Xiaowei
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 343 - 348
  • [5] Off-chip bus power minimization using serialization with cache-based encoding
    Mohammad, Khader
    Kabeer, Ahsan
    Taha, Tarek M.
    Owaida, Muhsen
    Washha, Mandi
    MICROELECTRONICS JOURNAL, 2016, 54 : 138 - 149
  • [6] EGCN: An Efficient GCN Accelerator for Minimizing Off-Chip Memory Access
    Han, Yunki
    Park, Kangkyu
    Jung, Youngbeom
    Kim, Lee-Sup
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (12) : 3127 - 3139
  • [7] Analysis of the Effect of Off-chip Memory Access on the Performance of an NPU System
    Lee, Keonjoo
    Kang, Donghyun
    Kang, Duseok
    Ha, Soonhoi
    PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022), 2022, : 13 - 18
  • [8] Optimizing Off-Chip Memory Access for Deep Neural Network Accelerator
    Zheng, Yong
    Yang, Haigang
    Shu, Yi
    Jia, Yiping
    Huang, Zhihong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (04) : 2316 - 2320
  • [9] SACC: Split and Combine Approach to Reduce the Off-chip Memory Accesses of LSTM Accelerators
    Tewari, Saurabh
    Kumar, Anshul
    Paul, Kolin
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 580 - 583
  • [10] POMMEL: Exploring Off-Chip Memory Energy & Power Consumption in Convolutional Neural Network Accelerators
    Montgomerie-Corcoran, Alexander
    Bouganis, Christos-Savvas
    2021 24TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2021), 2021, : 442 - 448