PLACID: A Platform for FPGA-Based Accelerator Creation for DCNNs

被引:21
作者
Motamedi, Mohammad [1 ]
Gysel, Philipp [1 ]
Ghiasi, Soheil [1 ]
机构
[1] Univ Calif Davis, Elect & Comp Engn Dept, One Shields Ave, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Convolutional neural networks; deep learning; accelerator design; design automation; COPROCESSOR;
D O I
10.1145/3131289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolutional Neural Networks (DCNNs) exhibit remarkable performance in a number of pattern recognition and classification tasks. Modern DCNNs involve many millions of parameters and billions of operations. Inference using such DCNNs, if implemented as software running on an embedded processor, results in considerable execution time and energy consumption, which is prohibitive in many mobile applications. Field-programmable gate array (FPGA)-based acceleration of DCNN inference is a promising approach to improve both energy consumption and classification throughput. However, the engineering effort required for development and verification of an optimized FPGA-based architecture is significant. In this article, we present PLACID, an automated PLatform for Accelerator CreatIon for DCNNs. PLACID uses an analytical approach to characterization and exploration of the implementation space. PLACID enables generation of an accelerator with the highest throughput for a given DCNN on a specific target FPGA platform. Subsequently, it generates an RTL level architecture in Verilog, which can be passed onto commercial tools for FPGA implementation. PLACID is fully automated, and reduces the accelerator design time from a few months down to a few hours. Experimental results show that architectures synthesized by PLACID yield 2x higher throughput density than the best competing approach.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] FPGA-Based Accelerators of Deep Learning Networks for Learning and Classification: A Review
    Shawahna, Ahmad
    Sait, Sadiq M.
    El-Maleh, Aiman
    IEEE ACCESS, 2019, 7 : 7823 - 7859
  • [22] MobileSP: An FPGA-Based Real-Time Keypoint Extraction Hardware Accelerator for Mobile VSLAM
    Liu, Ye
    Li, Jingyuan
    Huang, Kun
    Li, Xiangting
    Qi, Xiuyuan
    Chang, Liang
    Long, Yu
    Zhou, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (12) : 4919 - 4929
  • [23] Live Demonstration: FPGA-based CNN Accelerator with Filter-Wise-Optimized Bit Precision
    Nakata, Kengo
    Maki, Asuka
    Miyashita, Daisuke
    Tachibana, Fumihiko
    Suzuki, Tomoya
    Deguchi, Jun
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [24] An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications
    Zhao, Zhongyu
    Cao, Rujian
    Un, Ka-Fai
    Yu, Wei-Han
    Mak, Pui-In
    Martins, Rui P.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (01) : 281 - 285
  • [25] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
    Hwang, Wen-Jyi
    Jhang, Yun-Jie
    Tai, Tsung-Ming
    2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
  • [26] Service-Oriented Architecture on FPGA-Based MPSoC
    Wang, Chao
    Li, Xi
    Chen, Yunji
    Zhang, Youhui
    Diessel, Oliver
    Zhou, Xuehai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (10) : 2993 - 3006
  • [27] FPGA-based Low-Batch Training Accelerator for Modern CNNs Featuring High Bandwidth Memory
    Venkataramanaiah, Shreyas K.
    Suh, Han-Sok
    Yin, Shihui
    Nurvitadhi, Eriko
    Dasu, Aravind
    Cao, Yu
    Seo, Jae-Sun
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [28] FPGA-Based Road Crack Detection Using Deep Learning
    Canese, Lorenzo
    Cardarilli, Gian Carlo
    Di Nunzio, Luca
    Fazzolari, Rocco
    Re, Marco
    Spano, Sergio
    ADVANCES IN SYSTEM-INTEGRATED INTELLIGENCE, SYSINT 2022, 2023, 546 : 65 - 73
  • [29] An Energy-Efficient FPGA-Based Deconvolutional Neural Networks Accelerator for Single Image Super-Resolution
    Chang, Jung-Woo
    Kang, Keon-Woo
    Kang, Suk-Ju
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 281 - 295
  • [30] An Accelerated FPGA-Based Parallel CNN-LSTM Computing Device
    Zhou, Xin
    Xie, Wei
    Zhou, Han
    Cheng, Yongjing
    Wang, Ximing
    Ren, Yun
    Yuan, Shandong
    Li, Liuwen
    IEEE ACCESS, 2024, 12 : 106579 - 106592