Design Space Exploration for CNN Offloading to FPGAs at the Edge

被引:0
|
作者
Korol, Guilherme [1 ]
Jordan, Michael Guilherme [1 ]
Rutzig, Mateus Beck [2 ]
Castrillon, Jeronimo [3 ,4 ]
Schneider Beck, Antonio Carlos [1 ]
机构
[1] Univ Fed Rio Grande do Sul UFRGS, Inst Informat, Porto Alegre, Brazil
[2] Univ Fed Santa Maria UFSM, Elect & Comp Dept, Santa Maria, Brazil
[3] Tech Univ Dresden, Ctr Adv Elect Dresden, Dresden, Germany
[4] Ctr Scalable Data Analyt & Artificial Intelligenc, Dresden, Germany
来源
2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI | 2023年
基金
巴西圣保罗研究基金会;
关键词
Edge Computing; IoT; Offloading; CNN; FPGA;
D O I
10.1109/ISVLSI59464.2023.10238644
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
AI-based IoT applications relying on heavy-load deep learning algorithms like CNNs challenge IoT devices that are restricted in energy or processing capabilities. Edge computing offers an alternative by allowing the data to get offloaded to so-called edge servers with hardware more powerful than IoT devices and physically closer than the cloud. However, the increasing complexity of data and algorithms and diverse conditions make even powerful devices, such as those equipped with FPGAs, insufficient to cope with the current demands. In this case, optimizations in the algorithms, like pruning and early-exit, are mandatory to reduce the CNNs computational burden and speed up inference processing. With that in mind, we propose ExpOL, which combines the pruning and early-exit CNN optimizations in a system-level FPGA-based IoT-Edge design space exploration. Based on a user-defined multi-target optimization, ExpOL delivers designs tailored to specific application environments and user needs. When evaluated against state-of-the-art FPGA-based accelerators (either local or offloaded), designs produced by ExpOL are more power-efficient (by up to 2x) and process inferences at higher user quality of experience (by up to 12.5%).
引用
收藏
页码:276 / 281
页数:6
相关论文
共 50 条
  • [21] Design of Driver Stress Prediction Model with CNN-LSTM: Exploration of Feature Space using Genetic Programming
    Yang, Tingting
    Xue, Chenhao
    Chen, Jun
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [22] Approximation Algorithm for the Offloading Problem in Edge Computing
    Han, Xinxin
    Gao, Guichen
    Ning, Li
    Wang, Yang
    Zhang, Yong
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT I, 2020, 12384 : 134 - 144
  • [23] OpenMP Device Offloading to FPGAs Using the Nymble Infrastructure
    Huthmann, Jens
    Sommer, Lukas
    Podobas, Artur
    Koch, Andreas
    Sano, Kentaro
    OPENMP: PORTABLE MULTI-LEVEL PARALLELISM ON MODERN SYSTEMS, 2020, 12295 : 265 - 279
  • [24] Parameterized Deep Reinforcement Learning With Hybrid Action Space for Edge Task Offloading
    Wang, Ting
    Deng, Yuxiang
    Yang, Zhao
    Wang, Yang
    Cai, Haibin
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 10754 - 10767
  • [25] C-LSTM: CNN and LSTM Based Offloading Prediction Model in Mobile Edge Computing (MEC)
    Zhao, Ming
    Li, Yixiang
    Asif, Sohaib
    Zhu, Yusen
    Tang, Fengxiao
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 245 - 251
  • [26] Design Framework for SRAM-Based Computing-In-Memory Edge CNN Accelerators
    Wang, Yimin
    Zou, Zhuo
    Zheng, Lirong
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [27] Toward Computation Offloading in Edge Computing: A Survey
    Jiang, Congfeng
    Cheng, Xiaolan
    Gao, Honghao
    Zhou, Xin
    Wan, Jian
    IEEE ACCESS, 2019, 7 : 131543 - 131558
  • [28] Intelligent Offloading Strategy Design for Relaying Mobile Edge Computing Networks
    Guo, Yinghao
    Zhao, Zichao
    Zhao, Rui
    Lai, Shiwei
    Dan, Zou
    Xia, Junjuan
    Fan, Liseng
    IEEE ACCESS, 2020, 8 : 35127 - 35135
  • [29] IEEE802.16-2004 OFDM FUNCTIONS IMPLEMENTATION ON FPGAS WITH DESIGN EXPLORATION
    Sghaier, Ahmad
    Areibi, Shawki
    Dony, Robert
    2008 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE AND LOGIC APPLICATIONS, VOLS 1 AND 2, 2008, : 518 - 521
  • [30] Late Breaking Results: Automated Hardware Generation of CNN Models on FPGAs
    Kwadjo, Danielle Tchuinkou
    Bobda, Christophe
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,