A Multi-Level-Optimization Framework for FPGA-Based Cellular Neural Network Implementation

被引:14
|
作者
Liu, Zhongyang [1 ]
Luo, Shaoheng [1 ]
Xu, Xiaowei [2 ]
Shi, Yiyu [2 ]
Zhuo, Cheng [1 ]
机构
[1] Zhejiang Univ, 38 Zheda Rd, Hangzhou 310027, Zhejiang, Peoples R China
[2] Univ Notre Dame, Notre Dame, IN 46556 USA
关键词
Cellular neural network; FPGA; acceleration; IMAGE; CNN; CLASSIFICATION;
D O I
10.1145/3273957
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cellular Neural Network (CeNN) is considered as a powerful paradigm for embedded devices. Its analog and mix-signal hardware implementations are proved to be applicable to high-speed image processing, video analysis, and medical signal processing with its efficiency and popularity limited by smaller implementation size and lower precision. Recently, digital implementations of CeNNs on FPGA have attracted researchers from both academia and industry due to its high flexibility and short time-to-market. However, most existing implementations are not well optimized to fully utilize the advantages of FPGA platform with unnecessary design and computational redundancy that prevents speedup. We propose a multi-level-optimization framework for energy-efficient CeNN implementations on FPGAs. In particular, the optimization framework is featured with three level optimizations: system-, module-, and design-space-level, with focus on computational redundancy and attainable performance, respectively. Experimental results show that with various configurations our framework can achieve an energy-efficiency improvement of 3.54x and up to 3.88x speedup compared with existing implementations with similar accuracy.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
    Peng Xiyuan
    Yu Jinxiang
    Yao Bowen
    Liu Liansheng
    Peng Yu
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (01) : 1 - 17
  • [32] Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators
    Guan, Yijin
    Xu, Ningyi
    Zhang, Chen
    Yuan, Zhihang
    Cong, Jason
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, 2017, 10561 : 14 - 26
  • [33] An FPGA-Based Microinstruction Sequence Driven Spaceborne Convolution Neural Network Accelerator
    Guo Z.-B.
    Liu K.
    Hu H.-T.
    Li Y.-D.
    Qu Z.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (10): : 2047 - 2064
  • [34] FPGA BASED IMPLEMENTATION OF CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL CLASSIFICATION
    Chen, Xiaofeng
    Ji, Jingyu
    Mei, Shaohui
    Zhang, Yifan
    Han, Manli
    Du, Qian
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2451 - 2454
  • [35] An FPGA based Parallel Implementation for Point Cloud Neural Network
    Zheng, Xitao
    Zhu, Mingcheng
    Xu, Yuan
    Li, Yutong
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
  • [36] FPGA-based system for artificial neural network arrhythmia classification
    Zairi, Hadjer
    Talha, Malika Kedir
    Meddah, Karim
    Slimane, Saliha Ould
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 4105 - 4120
  • [37] Modulation recognition using an FPGA-based convolutional neural network
    Liu, Xueyuan
    Shang, Jing
    Leong, Philip H. W.
    Liu, Cheng
    2019 22ND INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS 2019), 2019, : 3165 - 3170
  • [38] Throughput optimizations for FPGA-based deep neural network inference
    Posewsky, Thorbjoern
    Ziener, Daniel
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 60 : 151 - 161
  • [39] FPGA-based system for artificial neural network arrhythmia classification
    Hadjer Zairi
    Malika Kedir Talha
    Karim Meddah
    Saliha Ould Slimane
    Neural Computing and Applications, 2020, 32 : 4105 - 4120
  • [40] FPGA-Based Implementation of a Real-Time Object Recognition System Using Convolutional Neural Network
    Gilan, Ali Azarmi
    Emad, Mohammad
    Alizadeh, Bijan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (04) : 755 - 759