Optimizing Deep Learning Acceleration on FPGA for Real-Time and Resource-Efficient Image Classification

被引:2
|
作者
Khaki, Ahmad Mouri Zadeh [1 ]
Choi, Ahyoung [1 ]
机构
[1] Gachon Univ, Dept AI & Software, Seongnam Si 13120, South Korea
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 01期
关键词
AI hardware acceleration; convolutional neural network (CNN); deep learning; field-programmable gate array (FPGA); transfer learning; TO-DIGITAL CONVERTER; DESIGN; IMPLEMENTATION; EYE; CNN;
D O I
10.3390/app15010422
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Deep learning (DL) has revolutionized image classification, yet deploying convolutional neural networks (CNNs) on edge devices for real-time applications remains a significant challenge due to constraints in computation, memory, and power efficiency. This work presents an optimized implementation of VGG16 and VGG19, two widely used CNN architectures, for classifying the CIFAR-10 dataset using transfer learning on field-programmable gate arrays (FPGAs). Utilizing the Xilinx Vitis-AI and TensorFlow2 frameworks, we adapt VGG16 and VGG19 for FPGA deployment through quantization, compression, and hardware-specific optimizations. Our implementation achieves high classification accuracy, with Top-1 accuracy of 89.54% and 87.47% for VGG16 and VGG19, respectively, while delivering significant reductions in inference latency (7.29x and 6.6x compared to CPU-based alternatives). These results highlight the suitability of our approach for resource-efficient, real-time edge applications. Key contributions include a detailed methodology for combining transfer learning with FPGA acceleration, an analysis of hardware resource utilization, and performance benchmarks. This work underscores the potential of FPGA-based solutions to enable scalable, low-latency DL deployments in domains such as autonomous systems, IoT, and mobile devices.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Resource-Efficient FPGA Architecture for Real-Time RFI Mitigation in Interferometric Radiometers
    Perez-Portero, Adrian
    Querol, Jorge
    Camps, Adriano
    SENSORS, 2024, 24 (24)
  • [2] Resource-Efficient Wearable Computing for Real-Time Reconfigurable Machine Learning: A Cascading Binary Classification
    Pedram, Mahdi
    Rokni, Seyed Ali
    Nourollahi, Marjan
    Homayoun, Houman
    Ghasemzadeh, Hassan
    2019 IEEE 16TH INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN), 2019,
  • [3] Resources and Power Efficient FPGA Accelerators for Real-Time Image Classification
    Kyriakos, Angelos
    Papatheofanous, Elissaios-Alexios
    Bezaitis, Charalampos
    Reisis, Dionysios
    JOURNAL OF IMAGING, 2022, 8 (04)
  • [4] Real-Time and Embedded Deep Learning on FPGA for RF Signal Classification
    Soltani, Sohraab
    Sagduyu, Yalin E.
    Hasan, Raqibul
    Davaslioglu, Kemal
    Deng, Hongmei
    Erpek, Tugba
    MILCOM 2019 - 2019 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2019,
  • [5] Adaptive Deep Learning for Soft Real-Time Image Classification
    Chai, Fangming
    Kang, Kyoung-Don
    TECHNOLOGIES, 2021, 9 (01)
  • [6] Resource-Efficient Execution of Conditional Parallel Real-Time Tasks
    Baruah, Sanjoy
    EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 218 - 231
  • [7] Real-Time Visual Inertial Odometry with a Resource-Efficient Harris Corner Detection Accelerator on FPGA Platform*
    Gu, Pengfei
    Meng, Ziyang
    Zhou, Pengkun
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10542 - 10548
  • [8] FPGA-Based Dynamic Deep Learning Acceleration for Real-Time Video Analytics
    Lu, Yufan
    Gao, Cong
    Saha, Rappy
    Saha, Sangeet
    McDonald-Maier, Klaus D.
    Zhai, Xiaojun
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 68 - 82
  • [9] A real-time, scalable, fast and resource-efficient decoder for a quantum computer
    Barber, Ben
    Barnes, Kenton M.
    Bialas, Tomasz
    Bugdayci, Okan
    Campbell, Earl T.
    Gillespie, Neil I.
    Johar, Kauser
    Rajan, Ram
    Richardson, Adam W.
    Skoric, Luka
    Topal, Canberk
    Turner, Mark L.
    Ziad, Abbas B.
    NATURE ELECTRONICS, 2025, 8 (01): : 84 - 91
  • [10] Exploring Resource-Efficient Acceleration Algorithm for Transposed Convolution of GANs on FPGA
    Di, Xinkai
    Yang, Haigang
    Huang, Zhihong
    Mao, Ning
    Jia, Yiping
    Zheng, Yong
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 19 - 27