A Layer-based Structured Design of CNN on FPGA

被引:0
|
作者
Huang, Chao [1 ]
Ni, Siyu [1 ]
Chen, Gengsheng [1 ]
机构
[1] Fudan Univ, State Key Lab ASIC & Syst, 825 Zhangheng Rd, Shanghai 201203, Peoples R China
来源
2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON) | 2017年
关键词
FPGA; CNNs; Data quantization; Pipeline; Structured design; SqueezeNet;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks (CNNs) are widely used in machine learning applications. Large in scale, most deep CNNs are however difficult to be implemented on a single hardware for acceleration. This paper presents a new design and implementation of a 23-layer SqueezeNet [1] on a Xilinx VC709 FPGA board. In this new design, a novel layer-based structured design method is proposed for full scalability in constructing CNNs, in which all the CNN layers are optimized and deployed separately and independently. Moreover, inherent parallelism in CNN's data channels and intra-kernel computations, together with the data structure in memory, are exploited and optimized for performance and efficiency enhancement. This new design and its architecture enables the whole CNN to have a flexible and scalable deployment, with all its layers working concurrently in a pipelined structure. Experimental result shows that, the newly implemented 23-layer SqueezeNet can reach its peak performance of 213.7G0P/s under 110MHz clock frequency with 79.05% top-5 accuracy, which is much faster and more efficient than similar works. Additionally, with the same CNN network built on, FPGA shows a much better performance than CPU, GPU and SoC in terms of power efficiency.
引用
收藏
页码:1037 / 1040
页数:4
相关论文
共 50 条
  • [21] Layer-based Buffer Aware Rate Adaptation Design for SHVC Video Streaming
    Gudumasu, Srinivas
    Hamza, Ahmed
    Asbun, Eduardo
    He, Yong
    Ye, Yan
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXIX, 2016, 9971
  • [22] Efficient FPGA design for Convolutions in CNN based on FFT-pruning
    He, Liulu
    Xie, Xiaoru
    Lin, Jun
    Wang, Zhongfeng
    APCCAS 2020: PROCEEDINGS OF THE 2020 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2020), 2020, : 27 - 30
  • [23] A Collaborative Framework for FPGA-based CNN Design Modeling and Optimization
    Mu, Jiandong
    Zhang, Wei
    Liang, Hao
    Sinha, Sharad
    2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, : 139 - 146
  • [24] Layer-based integration arithmetic of conformal PML
    School of Aeronautics, Northwestern Polytechnical University, Xi'an, 710072, China
    Appl Comput Electromagn Soc J, 2009, 5 (518-522):
  • [25] Layer-Based Approach for Image Pair Fusion
    Son, Chang-Hwan
    Zhang, Xiao-Ping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2866 - 2881
  • [26] Plenoptic Layer-Based Modeling for Image Based Rendering
    Pearson, James
    Brookes, Mike
    Dragotti, Pier Luigi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) : 3405 - 3419
  • [27] A layer-based method for rapid software development
    Lin, Lendy
    Yang, Weipang
    Lin, Jyhjong
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1364 - 1375
  • [28] Layer-Based Integration Arithmetic of Conformal PML
    Zhang, Y. J.
    Sun, Q.
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2009, 24 (05): : 518 - 522
  • [29] Layer-based decomposition of solids and its applications
    Antonio J. Rueda
    Francisco R. Feito
    Lidia M. Ortega
    The Visual Computer, 2005, 21 : 406 - 417
  • [30] Design of A Layer-based Video Streaming System over Software-Defined Networks
    Kalan, Reza Shokri
    Cetinkaya, Cihat
    Sayit, Muge
    PROCEEDINGS OF THE 2017 8TH INTERNATIONAL CONFERENCE ON THE NETWORK OF THE FUTURE (NOF), 2017, : 8 - 13