Design Space Exploration for CNN Offloading to FPGAs at the Edge

被引:0
|
作者
Korol, Guilherme [1 ]
Jordan, Michael Guilherme [1 ]
Rutzig, Mateus Beck [2 ]
Castrillon, Jeronimo [3 ,4 ]
Schneider Beck, Antonio Carlos [1 ]
机构
[1] Univ Fed Rio Grande do Sul UFRGS, Inst Informat, Porto Alegre, Brazil
[2] Univ Fed Santa Maria UFSM, Elect & Comp Dept, Santa Maria, Brazil
[3] Tech Univ Dresden, Ctr Adv Elect Dresden, Dresden, Germany
[4] Ctr Scalable Data Analyt & Artificial Intelligenc, Dresden, Germany
来源
2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI | 2023年
基金
巴西圣保罗研究基金会;
关键词
Edge Computing; IoT; Offloading; CNN; FPGA;
D O I
10.1109/ISVLSI59464.2023.10238644
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
AI-based IoT applications relying on heavy-load deep learning algorithms like CNNs challenge IoT devices that are restricted in energy or processing capabilities. Edge computing offers an alternative by allowing the data to get offloaded to so-called edge servers with hardware more powerful than IoT devices and physically closer than the cloud. However, the increasing complexity of data and algorithms and diverse conditions make even powerful devices, such as those equipped with FPGAs, insufficient to cope with the current demands. In this case, optimizations in the algorithms, like pruning and early-exit, are mandatory to reduce the CNNs computational burden and speed up inference processing. With that in mind, we propose ExpOL, which combines the pruning and early-exit CNN optimizations in a system-level FPGA-based IoT-Edge design space exploration. Based on a user-defined multi-target optimization, ExpOL delivers designs tailored to specific application environments and user needs. When evaluated against state-of-the-art FPGA-based accelerators (either local or offloaded), designs produced by ExpOL are more power-efficient (by up to 2x) and process inferences at higher user quality of experience (by up to 12.5%).
引用
收藏
页码:276 / 281
页数:6
相关论文
共 50 条
  • [31] UniWiG: Unified Winograd-GEMM Architecture for Accelerating CNN on FPGAs
    Kala, S.
    Mathew, Jimson
    Jose, Babita R.
    Nalesh, S.
    2019 32ND INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2019 18TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2019, : 209 - 214
  • [32] Task offloading in edge computing for machine learning-based smart healthcare
    Aazam, Mohammad
    Zeadally, Sherali
    Flushing, Eduardo Feo
    COMPUTER NETWORKS, 2021, 191
  • [33] Efficient Edge-AI Application Deployment for FPGAs
    Kalapothas, Stavros
    Flamis, Georgios
    Kitsos, Paris
    INFORMATION, 2022, 13 (06)
  • [34] Exploration of Data from Smart Bands in the Cloud and on the Edge - The Impact on the Data Storage Space
    Golosz, Mateusz
    Mrozek, Dariusz
    COMPUTATIONAL SCIENCE - ICCS 2019, PT III, 2019, 11538 : 607 - 620
  • [35] Adaptive and Offloaded CNNs for IoT-Edge FPGAs
    Korol, Guilherme
    Rocha, Hiago Mayk G. de A.
    Schneider Beck, Antonio Carlos
    2024 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2024, : 69 - 74
  • [36] FADES: Fine-Grained Edge Offloading with Unikernels
    Cozzolino, Vittorio
    Ding, Aaron Yi
    Ott, Joerg
    PROCEEDINGS OF THE 2017 WORKSHOP ON HOT TOPICS IN CONTAINER NETWORKING AND NETWORKED SYSTEMS (HOTCONNET 17), 2017, : 36 - 41
  • [37] ADS-CNN: Adaptive Dataflow Scheduling for lightweight CNN accelerator on FPGAs
    Wan, Yi
    Xie, Xianzhong
    Chen, Junfan
    Xie, Kunpeng
    Yi, Dezhi
    Lu, Ye
    Gai, Keke
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 158 : 138 - 149
  • [38] A semidefinite relaxation approach for the offloading problem in edge computing
    Han, Xinxin
    Gao, Guichen
    Ning, Li
    Wang, Yang
    Zhang, Yong
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 98
  • [39] A Container Based Edge Offloading Framework for Autonomous Driving
    Tang, Jie
    Yu, Rao
    Liu, Shaoshan
    Gaudiot, Jean-Luc
    IEEE ACCESS, 2020, 8 : 33713 - 33726
  • [40] An Efficient Reconfigurable Framework for General Purpose CNN-RNN Models on FPGAs
    Zeng, Shulin
    Guo, Kaiyuan
    Fang, Shaoxia
    Kang, Junlong
    Xie, Dongliang
    Shan, Yi
    Wang, Yu
    Yang, Huazhong
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,