Nebula: A Scalable and Flexible Accelerator for DNN Multi-Branch Blocks on Embedded Systems

被引:0
作者
Yang, Dawei [1 ]
Li, Xinlei [2 ]
Qi, Lizhe [1 ]
Zhang, Wenqiang [1 ]
Jiang, Zhe [3 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Shanghai Univ Int Business & Econ, Sch Stat & Informat, Shanghai 201620, Peoples R China
[3] Univ Cambridge, Comp Sci, Cambridge CB3 0FD, England
关键词
DNN accelerators; multi-branch network; energy-efficient accelerators;
D O I
10.3390/electronics11040505
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) are widely used in many artificial intelligence applications; many specialized DNN-inference accelerators have been proposed. However, existing DNN accelerators rely heavily on certain types of DNN operations (such as Conv, FC, and ReLU, etc.), which are either less used or likely to become out of date in future, posing challenges of flexibility and compatibility to existing work. This paper designs a flexible DNN accelerator from a more generic perspective rather than speeding up certain types of DNN operations. Our proposed Nebula exploits the width property of DNNs and gains a significant improvement in system throughput and energy efficiency over multi-branch architectures. Nebula is a first-of-its-kind framework for multi-branch DNNs.
引用
收藏
页数:13
相关论文
共 33 条
  • [21] Mashimo S., P 2019 INT C FIELD P
  • [22] SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
    Parashar, Angshuman
    Rhu, Minsoo
    Mukkara, Anurag
    Puglielli, Antonio
    Venkatesan, Rangharajan
    Khailany, Brucek
    Emer, Joel
    Keckler, Stephen W.
    Dally, William J.
    [J]. 44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 27 - 40
  • [23] Plumbridge G., 2014, ACM SIGARCH Comput. Architecture News, V41, P107, DOI 10.1145/2641361.2641379
  • [24] Ramachandran P, 2017, Searching for activation functions
  • [25] Sharma H., P 2016 49 ANN IEEE A, P1
  • [26] Szegedy C., P 2015 IEEE C COMP V, P1
  • [27] Vaswani A, 2017, ADV NEUR IN, V30
  • [28] Wang H., P IEEE CVF C COMP VI, P5463
  • [29] Aggregated Residual Transformations for Deep Neural Networks
    Xie, Saining
    Girshick, Ross
    Dollar, Piotr
    Tu, Zhuowen
    He, Kaiming
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5987 - 5995
  • [30] Xu P., P 2020 ACM SIGDA INT