A Reconfigurable Coarse-to-Fine Approach for the Execution of CNN Inference Models in Low-Power Edge Devices

被引:1
作者
Rangsikunpum, Auangkun [1 ]
Amiri, Sam [1 ]
Ost, Luciano [1 ]
机构
[1] Loughborough Univ, Wolfson Sch Mech Elect & Mfg Engn, Loughborough, England
关键词
CONVOLUTIONAL NEURAL-NETWORKS; IMPLEMENTATION;
D O I
10.1049/cdt2/6214436
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have evolved into essential components for a wide range of embedded applications due to their outstanding efficiency and performance. To efficiently deploy CNN inference models on resource-constrained edge devices, field programmable gate arrays (FPGAs) have become a viable processing solution because of their unique hardware characteristics, enabling flexibility, parallel computation and low-power consumption. In this regard, this work proposes an FPGA-based dynamic reconfigurable coarse-to-fine (C2F) inference of CNN models, aiming to increase power efficiency and flexibility. The proposed C2F approach first coarsely classifies related input images into superclasses and then selects the appropriate fine model(s) to recognise and classify the input images according to their bespoke categories. Furthermore, the proposed architecture can be reprogrammed to the original model using partial reconfiguration (PR) in case the typical classification is required. To efficiently utilise different fine models on low-cost FPGAs with area minimisation, ZyCAP-based PR is adopted. Results show that our approach significantly improves the classification process when object identification of only one coarse category of interest is needed. This approach can reduce energy consumption and inference time by up to 27.2% and 13.2%, respectively, which can greatly benefit resource-constrained applications.
引用
收藏
页数:16
相关论文
共 58 条
[1]  
AbdElMaksoud A. J., FPGA Design of High-Speed Convolutional Neural Network Hardware Accelerator, P376
[2]   Tactics to Directly Map CNN Graphs on Embedded FPGAs [J].
Abdelouahab, K. ;
Pelcat, M. ;
Serot, J. ;
Bourrasset, C. ;
Berry, F. .
IEEE EMBEDDED SYSTEMS LETTERS, 2017, 9 (04) :113-116
[3]  
AMD/Xilinx, 2023, VCK5000 Versal development card
[4]  
AMD Xilinx Inc, 2023, Vivado Design Suite User Guide: Dynamic Function
[5]  
AMD Xilinx Inc, 2023, Xilinx Vivado
[6]  
AMD Xilinx Inc, 2023, Xilinx Vitis AI
[7]  
ARM, 2023, AMBA Specifications
[8]   CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope [J].
Bhatt, Dulari ;
Patel, Chirag ;
Talsania, Hardik ;
Patel, Jigar ;
Vaghela, Rasmika ;
Pandya, Sharnil ;
Modi, Kirit ;
Ghayvat, Hemant .
ELECTRONICS, 2021, 10 (20)
[9]   FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks [J].
Blott, Michaela ;
Preusser, Thomas B. ;
Fraser, Nicholas J. ;
Gambardella, Giulio ;
O'Brien, Kenneth ;
Umuroglu, Yaman ;
Leeser, Miriam ;
Vissers, Kees .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2018, 11 (03)
[10]  
Bojarski Mariusz, 2016, arXiv