A Low-Cost Neural ODE with Depthwise Separable Convolution for Edge Domain Adaptation on FPGAs

被引:1
|
作者
Kawakami, Hiroki [1 ]
Watanabe, Hirohisa [1 ]
Sugiura, Keisuke [1 ]
Matsutani, Hiroki [1 ]
机构
[1] Keio Univ, Grad Sch Sci & Technol, Yokohama 2238522, Japan
基金
日本学术振兴会;
关键词
domain adaptation; neural ODE; distillation; FPGA; edge device;
D O I
10.1587/transinf.2022EDP7149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance deep neural network (DNN)-based systems are in high demand in edge environments. Due to its high com-putational complexity, it is challenging to deploy DNNs on edge devices with strict limitations on computational resources. In this paper, we derive a compact while highly-accurate DNN model, termed dsODENet, by com-bining recently-proposed parameter reduction techniques: Neural ODE (Ordinary Differential Equation) and DSC (Depthwise Separable Convo-lution). Neural ODE exploits a similarity between ResNet and ODE, and shares most of weight parameters among multiple layers, which greatly reduces the memory consumption. We apply dsODENet to a domain adap-tation as a practical use case with image classification datasets. We also propose a resource-efficient FPGA-based design for dsODENet, where all the parameters and feature maps except for pre-and post-processing lay-ers can be mapped onto on-chip memories. It is implemented on Xilinx ZCU104 board and evaluated in terms of domain adaptation accuracy, in-ference speed, FPGA resource utilization, and speedup rate compared to a software counterpart. The results demonstrate that dsODENet achieves comparable or slightly better domain adaptation accuracy compared to our baseline Neural ODE implementation, while the total parameter size with-out pre-and post-processing layers is reduced by 54.2% to 79.8%. Our FPGA implementation accelerates the inference speed by 23.8 times.
引用
收藏
页码:1186 / 1197
页数:12
相关论文
共 30 条
  • [1] Low-Cost Domain Adaptation for Crop and Weed Segmentation
    Vasconcelos, Gustavo J. Q.
    Spina, Thiago V.
    Pedrini, Helio
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 141 - 150
  • [2] JPEG encoder for low-cost FPGAs
    Osman, Hossam
    Mahjoup, Waseim
    Nabih, Azza
    Aly, Gamal M.
    2007 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS: ICCES '07, 2007, : 406 - +
  • [3] Low-cost Technique for Measuring Clock Duty Cycle on FPGAs
    Lee, Seongkwan
    Kim, Taehwan
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [4] Low-cost ANS encoder for lossless data compression in FPGAs
    Pastula, Magdalena
    Russek, Pawel
    Wiatr, Kazimierz
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2024, 70 (01) : 219 - 228
  • [5] High-Speed Clock Recovery for Low-Cost FPGAs
    Haller, Istvan
    Baruch, Zoltan Francisc
    2010 DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2010), 2010, : 610 - 613
  • [6] Design and implementation of an efficient CNN accelerator for low-cost FPGAs
    Xu Y.
    Wang S.
    Li N.
    Xiao H.
    IEICE Electronics Express, 2022, 19 (19)
  • [7] A Case for Low-Cost Personal Electronic Laboratory Equipment Using FPGAs
    Adegbite, Timothy Olanrewaju
    Akinwale, Olawale Babatunde
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (10) : 4 - 19
  • [8] Domain Adaptation-Based Deep Calibration of Low-Cost PM2.5 Sensors
    Jha, Sonu Kumar
    Kumar, Mohit
    Arora, Vipul
    Tripathi, Sachchida Nand
    Motghare, Vidyanand Motiram
    Shingare, A. A.
    Rajput, Karansingh A.
    Kamble, Sneha
    IEEE SENSORS JOURNAL, 2021, 21 (22) : 25941 - 25949
  • [9] An Efficient CNN Accelerator for Low-Cost Edge Systems
    Choi, Kyubaik
    Sobelman, Gerald E.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (04)
  • [10] PASC: Physically Authenticated Stable-Clocked SoC Platform on Low-Cost FPGAs
    Aysu, Aydin
    Schaumont, Patrick
    2013 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2013,