Polymorphic Accelerators for Deep Neural Networks

被引：11

作者：

Azizimazreah, Arash ^{[1
]}

Chen, Lizhong ^{[1
]}

机构：

[1] Oregon State Univ, Sch Elect Engn & Comp Sci, Corvallis, OR 97331 USA

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2022年 / 71卷 / 03期

基金：

美国国家科学基金会;

关键词：

Arrays; System-on-chip; Buffer storage; Neural networks; Parallel processing; Internet; Hardware; Deep neural networks; accelerators; configurable processing element (PE) array; PE array utilization; data reuse;

D O I：

10.1109/TC.2020.3048624

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep neural networks (DNNs) come with many forms, such as convolutional neural networks, multilayer perceptron, and recurrent neural networks, to meet diverse needs of machine learning applications. However, existing DNN accelerator designs, when used to execute multiple neural networks, suffer from underutilization of processing elements, heavy feature map traffic, and large area overhead. In this article, we propose a novel approach, Polymorphic Accelerators, to address the flexibility issue fundamentally. We introduce the abstraction of logical accelerators to decouple the fixed mapping with physical resources. Three procedures are proposed that work collaboratively to reconfigure the accelerator for the current network that is being executed and to enable cross-layer data reuse among logical accelerators. Evaluation results show that the proposed approach achieves significant improvement in data reuse, inference latency and performance, e.g., 1.52x and 1.63x increase in throughput compared with state-of-the-art flexible dataflow approach and resource partitioning approach, respectively. This demonstrates the effectiveness and promise of polymorphic accelerator architecture.

引用

页码：534 / 546

页数：13

共 45 条

[1] SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks [J].

Akhlaghi, Vahideh ;

Yazdanbakhsh, Amir ;

Samadi, Kambiz ;

Gupta, Rajesh K. ;

Esmaeilzadeh, Hadi .

2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, :662-673

[2]

Alwani M, 2016, INT SYMP MICROARCH

[3]

AZIZIMAZREAH A, 2018, 2018 IEEE INT C, pNIL16

[4] Shortcut Mining: Exploiting Cross-layer Shortcut Reuse in DCNN Accelerators [J].

Azizimazreah, Arash ;

Chen, Lizhong .

2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2019, :94-105

[5] DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving [J].

Chen, Chenyi ;

Seff, Ari ;

Kornhauser, Alain ;

Xiao, Jianxiong .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2722-2730

[6] Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [J].

Chen, Yu-Hsin ;

Krishna, Tushar ;

Emer, Joel S. ;

Sze, Vivienne .

IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) :127-138

[7] Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks [J].

Chen, Yu-Hsin ;

Emer, Joel ;

Sze, Vivienne .

2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :367-379

[8] PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory [J].

Chi, Ping ;

Li, Shuangchen ;

Xu, Cong ;

Zhang, Tao ;

Zhao, Jishen ;

Liu, Yongpan ;

Wang, Yu ;

Xie, Yuan .

2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :27-39

[9] A Configurable Cloud-Scale DNN Processor for Real-Time AI [J].

Fowers, Jeremy ;

Ovtcharov, Kalin ;

Papamichael, Michael ;

Massengill, Todd ;

Liu, Ming ;

Lo, Daniel ;

Alkalay, Shlomi ;

Haselman, Michael ;

Adams, Logan ;

Ghandi, Mahdi ;

Heil, Stephen ;

Patel, Prerak ;

Sapek, Adam ;

Weisz, Gabriel ;

Woods, Lisa ;

Lanka, Sitaram ;

Reinhardt, Steven K. ;

Caulfield, Adrian M. ;

Chung, Eric S. ;

Burger, Doug .

2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, :1-14

[10] TANGRAM: Optimized Coarse-Grained Dataflow for Scalable NN Accelerators [J].

Gao, Mingyu ;

Yang, Xuan ;

Pu, Jing ;

Horowitz, Mark ;

Kozyrakis, Christos .

TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, :807-820

← 1 2 3 4 5 →