Heterogeneous Multi-Chiplets Neural Network Accelerator

被引:0
|
作者
Zhu G. [1 ]
Ma S. [1 ]
Zhang C. [1 ]
Wang B. [1 ]
机构
[1] College of Computer, National University of Defense Technology, Changsha
来源
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics | 2023年 / 35卷 / 05期
关键词
accelerator; chiplet; heterogeneous; neural network;
D O I
10.3724/SP.J.1089.2023.19451
中图分类号
学科分类号
摘要
With the rapid development of neural network technology, a large number of edge computing devices are used in the field of intelligent computing for security reasons. Firstly, this paper designs the basic structure and components of a heterogeneous multi-chiplets neural network accelerator that can be applied to edge computing. Secondly, we precompute the computational load on the heterogeneous cores, divide the computational tasks on the neural network channels, continuously add new tasks, test and iterate chipet by chiplet, and select the combination of heterogeneous chiplets to build the neural network accelerator. Finally, the heterogeneous multi-chiplets neural network accelerator is constructed on the test neural network, MobileNet, and ShuffleNet with this coarse-grained optimization method, and its energy consumption and performance are tested respectively. The experimental results show that this heterogeneous design approach can achieve acceleration ratios of 7.43, 2.30, and 5.60, respectively, while controlling the energy consumption. © 2023 Institute of Computing Technology. All rights reserved.
引用
收藏
页码:811 / 818
页数:7
相关论文
共 19 条
  • [1] Redmon J, Divvala S, Girshick R, Et al., You only look once: unified, real-time object detection, Proceedings of the Computer Vision and Pattern Recognition, pp. 779-788, (2016)
  • [2] Hu X, Yang W J, Wen H, Et al., A lightweight 1-D convolution augmented transformer with metric learning for hyperspectral image classification, Sensors, 21, 5, (2021)
  • [3] Yurtsever E, Lambert J, Carballo A, Et al., A survey of autonomous driving: common practices and emerging technologies, IEEE Access, 8, pp. 58443-58469, (2020)
  • [4] Zhai Y Z, Ding B, Liu X, Et al., Decentralized multi-robot collision avoidance in complex scenarios with selective communication, IEEE Robotics and Automation Letters, 6, 4, pp. 8379-8386, (2021)
  • [5] Way T, Bertrand E., Machine learning on the edge: azure ML and azure loT edge integration, MSDN Magazine, 34, 2, pp. 26-31, (2019)
  • [6] Jouppi N, Young C, Patil N, Et al., Motivation for and evaluation of the first tensor processing unit, IEEE Miceo, 38, 3, pp. 10-19, (2018)
  • [7] Kurniawan A., IoT projects with NVIDIA jetson nano: AI-enabled internet of things projects for beginners, (2021)
  • [8] Yin S Y, Ouyang P, Tang S B, Et al., A 1.06-to-5.09 TOPS/W reconfigurable hybrid-neural-network processor for deep learning applications, Proceedings of the Symposium on VLSI Circuits, pp. C26-C27, (2017)
  • [9] Shao Y S, Cemons J, Venkatesan R, Et al., Simba: scaling deep-learning inference with chiplet-based architecture, Communications of the ACM, 64, 6, pp. 107-116, (2021)
  • [10] Mahajan R, Sankman R, Patel N, Et al., Embedded multi-die interconnect bridge (EMIB) - a high density, high bandwidth packaging interconnect, Proceedings of the 66th IEEE Electronic Components and Technology Conference, pp. 557-565, (2016)