A Multi-Level-Optimization Framework for FPGA-Based Cellular Neural Network Implementation

被引：14

作者：

Liu, Zhongyang ^{[1
]}

Luo, Shaoheng ^{[1
]}

Xu, Xiaowei ^{[2
]}

Shi, Yiyu ^{[2
]}

Zhuo, Cheng ^{[1
]}

机构：

[1] Zhejiang Univ, 38 Zheda Rd, Hangzhou 310027, Zhejiang, Peoples R China

[2] Univ Notre Dame, Notre Dame, IN 46556 USA

来源：

ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS | 2018年 / 14卷 / 04期

关键词：

Cellular neural network; FPGA; acceleration; IMAGE; CNN; CLASSIFICATION;

D O I：

10.1145/3273957

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cellular Neural Network (CeNN) is considered as a powerful paradigm for embedded devices. Its analog and mix-signal hardware implementations are proved to be applicable to high-speed image processing, video analysis, and medical signal processing with its efficiency and popularity limited by smaller implementation size and lower precision. Recently, digital implementations of CeNNs on FPGA have attracted researchers from both academia and industry due to its high flexibility and short time-to-market. However, most existing implementations are not well optimized to fully utilize the advantages of FPGA platform with unnecessary design and computational redundancy that prevents speedup. We propose a multi-level-optimization framework for energy-efficient CeNN implementations on FPGAs. In particular, the optimization framework is featured with three level optimizations: system-, module-, and design-space-level, with focus on computational redundancy and attainable performance, respectively. Experimental results show that with various configurations our framework can achieve an energy-efficiency improvement of 3.54x and up to 3.88x speedup compared with existing implementations with similar accuracy.

引用

页数：17

共 50 条

[31] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
Peng Xiyuan
Yu Jinxiang
Yao Bowen
Liu Liansheng
Peng Yu
CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (01) : 1 - 17
[32] Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators
Guan, Yijin
Xu, Ningyi
Zhang, Chen
Yuan, Zhihang
Cong, Jason
ADVANCED PARALLEL PROCESSING TECHNOLOGIES, 2017, 10561 : 14 - 26
[33] An FPGA-Based Microinstruction Sequence Driven Spaceborne Convolution Neural Network Accelerator
Guo Z.-B.
Liu K.
Hu H.-T.
Li Y.-D.
Qu Z.-X.
Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (10): : 2047 - 2064
[34] FPGA BASED IMPLEMENTATION OF CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL CLASSIFICATION
Chen, Xiaofeng
Ji, Jingyu
Mei, Shaohui
Zhang, Yifan
Han, Manli
Du, Qian
IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2451 - 2454
[35] An FPGA based Parallel Implementation for Point Cloud Neural Network
Zheng, Xitao
Zhu, Mingcheng
Xu, Yuan
Li, Yutong
2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
[36] FPGA-based system for artificial neural network arrhythmia classification
Zairi, Hadjer
Talha, Malika Kedir
Meddah, Karim
Slimane, Saliha Ould
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 4105 - 4120
[37] Modulation recognition using an FPGA-based convolutional neural network
Liu, Xueyuan
Shang, Jing
Leong, Philip H. W.
Liu, Cheng
2019 22ND INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS 2019), 2019, : 3165 - 3170
[38] Throughput optimizations for FPGA-based deep neural network inference
Posewsky, Thorbjoern
Ziener, Daniel
MICROPROCESSORS AND MICROSYSTEMS, 2018, 60 : 151 - 161
[39] FPGA-based system for artificial neural network arrhythmia classification
Hadjer Zairi
Malika Kedir Talha
Karim Meddah
Saliha Ould Slimane
Neural Computing and Applications, 2020, 32 : 4105 - 4120
[40] FPGA-Based Implementation of a Real-Time Object Recognition System Using Convolutional Neural Network
Gilan, Ali Azarmi
Emad, Mohammad
Alizadeh, Bijan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (04) : 755 - 759

← 1 2 3 4 5 →