PLACID: A Platform for FPGA-Based Accelerator Creation for DCNNs

被引：23

作者：

Motamedi, Mohammad ^{[1
]}

Gysel, Philipp ^{[1
]}

Ghiasi, Soheil ^{[1
]}

机构：

[1] Univ Calif Davis, Elect & Comp Engn Dept, One Shields Ave, Davis, CA 95616 USA

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2017年 / 13卷 / 04期

基金：

美国国家科学基金会;

关键词：

Convolutional neural networks; deep learning; accelerator design; design automation; COPROCESSOR;

D O I：

10.1145/3131289

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Convolutional Neural Networks (DCNNs) exhibit remarkable performance in a number of pattern recognition and classification tasks. Modern DCNNs involve many millions of parameters and billions of operations. Inference using such DCNNs, if implemented as software running on an embedded processor, results in considerable execution time and energy consumption, which is prohibitive in many mobile applications. Field-programmable gate array (FPGA)-based acceleration of DCNN inference is a promising approach to improve both energy consumption and classification throughput. However, the engineering effort required for development and verification of an optimized FPGA-based architecture is significant. In this article, we present PLACID, an automated PLatform for Accelerator CreatIon for DCNNs. PLACID uses an analytical approach to characterization and exploration of the implementation space. PLACID enables generation of an accelerator with the highest throughput for a given DCNN on a specific target FPGA platform. Subsequently, it generates an RTL level architecture in Verilog, which can be passed onto commercial tools for FPGA implementation. PLACID is fully automated, and reduces the accelerator design time from a few months down to a few hours. Experimental results show that architectures synthesized by PLACID yield 2x higher throughput density than the best competing approach.

引用

页数：21

共 50 条

[21] FPGA-Based Accelerators of Deep Learning Networks for Learning and Classification: A Review [J].

Shawahna, Ahmad ;

Sait, Sadiq M. ;

El-Maleh, Aiman .

IEEE ACCESS, 2019, 7 :7823-7859

[22] MobileSP: An FPGA-Based Real-Time Keypoint Extraction Hardware Accelerator for Mobile VSLAM [J].

Liu, Ye ;

Li, Jingyuan ;

Huang, Kun ;

Li, Xiangting ;

Qi, Xiuyuan ;

Chang, Liang ;

Long, Yu ;

Zhou, Jun .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (12) :4919-4929

[23] Live Demonstration: FPGA-based CNN Accelerator with Filter-Wise-Optimized Bit Precision [J].

Nakata, Kengo ;

Maki, Asuka ;

Miyashita, Daisuke ;

Tachibana, Fumihiko ;

Suzuki, Tomoya ;

Deguchi, Jun .

2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,

[24] An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications [J].

Zhao, Zhongyu ;

Cao, Rujian ;

Un, Ka-Fai ;

Yu, Wei-Han ;

Mak, Pui-In ;

Martins, Rui P. .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (01) :281-285

[25] An Efficient FPGA-Based Architecture for Convolutional Neural Networks [J].

Hwang, Wen-Jyi ;

Jhang, Yun-Jie ;

Tai, Tsung-Ming .

2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, :582-588

[26] Service-Oriented Architecture on FPGA-Based MPSoC [J].

Wang, Chao ;

Li, Xi ;

Chen, Yunji ;

Zhang, Youhui ;

Diessel, Oliver ;

Zhou, Xuehai .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (10) :2993-3006

[27] High-Performance Sensing: Application of FPGA-based Winograd CNN Accelerator in Electronic Nose Systems [J].

Tan, Aolong ;

Duan, Shukai ;

Chen, Mingzhe ;

Ding, Ke ;

Li, Changqing ;

Wang, Lidan .

2024 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, ICAICE, 2024, :712-715

[28] An FPGA-based bit-level weight sparsity and mixed-bit accelerator for neural networks☆ [J].

Hu, Xianghong ;

Fu, Shansen ;

Lin, Yuanmiao ;

Li, Xueming ;

Yang, Chaoming ;

Li, Rongfeng ;

Huang, Hongmin ;

Cai, Shuting ;

Xiong, Xiaoming .

JOURNAL OF SYSTEMS ARCHITECTURE, 2025, 166

[29] FPGA-based Low-Batch Training Accelerator for Modern CNNs Featuring High Bandwidth Memory [J].

Venkataramanaiah, Shreyas K. ;

Suh, Han-Sok ;

Yin, Shihui ;

Nurvitadhi, Eriko ;

Dasu, Aravind ;

Cao, Yu ;

Seo, Jae-Sun .

2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,

[30] FPGA-Based Road Crack Detection Using Deep Learning [J].

Canese, Lorenzo ;

Cardarilli, Gian Carlo ;

Di Nunzio, Luca ;

Fazzolari, Rocco ;

Re, Marco ;

Spano, Sergio .

ADVANCES IN SYSTEM-INTEGRATED INTELLIGENCE, SYSINT 2022, 2023, 546 :65-73

← 1 2 3 4 5 →