A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge

被引:1
|
作者
Mahmud, Hasanul [1 ]
Kang, Peng [1 ]
Desai, Kevin [1 ]
Lama, Palden [1 ]
Prasad, Sushil K. [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Energy-efficiency; Deep Neural Networks; Edge Computing; Early-exit DNNs; Converting Autoencoder;
D O I
10.1109/IPDPSW63119.2024.00117
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reducing inference time and energy usage while maintaining prediction accuracy has become a significant concern for deep neural networks (DNN) inference on resourcecon-strained edge devices. To address this problem, we propose a novel approach based on "converting" autoencoder and lightweight DNNs. This improves upon recent work such as early-exiting framework and DNN partitioning. Early-exiting frameworks spend different amounts of computation power for different input data depending upon their complexity. However, they can be inefficient in real-world scenarios that deal with many hard image samples. On the other hand, DNN partitioning algorithms that utilize the computation power of both the cloud and edge devices can be affected by network delays and intermittent connections between the cloud and the edge. We present CBNet, a low-latency and energy-efficient DNN inference framework tailored for edge devices. It utilizes a "converting" autoencoder to efficiently transform hard images into easy ones, which are subsequently processed by a lightweight DNN for inference. To the best of our knowledge, such autoencoder has not been proposed earlier. Our experimental results using three popular image-classification datasets on a Raspberry Pi 4, a Google Cloud instance, and an instance with Nvidia Tesla K80 GPU show that CBNet achieves up to 4.8 x speedup in inference latency and 79% reduction in energy usage compared to competing techniques while maintaining similar or higher accuracy.
引用
收藏
页码:592 / 599
页数:8
相关论文
共 50 条
  • [31] Panacea: A Low-Latency, Energy-Efficient Neighbor Discovery Protocol for Wireless Sensor Networks
    Cao, Zhen
    Gu, Zhaoquan
    Wang, Yuexuan
    Cui, Heming
    2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2018,
  • [32] Energy-Efficient Low-Latency Signed Multiplier for FPGA-Based Hardware Accelerators
    Ullah, Salim
    Nguyen, Tuan Duy Anh
    Kumar, Akash
    IEEE EMBEDDED SYSTEMS LETTERS, 2021, 13 (02) : 41 - 44
  • [33] Low-Latency Smart Grid Asset Monitoring for Load Control of Energy-Efficient Buildings
    Al-Anbagi, Irfan
    Erol-Kantarci, Melike
    Mouftah, Hussein T.
    2012 IEEE International Conference on Smart Grid Engineering (SGE), 2012,
  • [34] Study on the Solutions to Heterogeneous ONU Propagation Delays for Energy-Efficient and Low-Latency EPONs
    Lv, Yunxin
    Bi, Meihua
    Zhai, Yanrong
    Chi, Hao
    Wang, Yuxi
    IEEE ACCESS, 2020, 8 : 193665 - 193680
  • [35] LEoNIDS: A Low-Latency and Energy-Efficient Network-Level Intrusion Detection System
    Tsikoudis, Nikos
    Papadogiannakis, Antonis
    Markatos, Evangelos P.
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2016, 4 (01) : 142 - 155
  • [36] On the design of an energy-efficient low-latency integrated protocol for distributed mobile sensor networks
    Ruzzelli, AG
    Evers, L
    Dulman, S
    van Hoesel, LFW
    Havinga, PJM
    2004 INTERNATIONAL WORKSHOP ON WIRELESS AD-HOC NETWORKS, 2005, : 35 - 44
  • [37] Energy-Efficient, Low-Latency Realization of Neural Networks through Boolean Logic Minimization
    Nazemi, Mahdi
    Pasandi, Ghasem
    Pedram, Massoud
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 274 - 279
  • [38] U-Connect: A Low-Latency Energy-Efficient Asynchronous Neighbor Discovery Protocol
    Kandhalu, Arvind
    Lakshmanan, Karthik
    Rajkumar, Ragunathan
    PROCEEDINGS OF THE 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2010, : 350 - 361
  • [39] Low-Latency Distributed Inference at the Network Edge Using Rateless Codes
    Frigard, Anton
    Kumar, Siddhartha
    Rosnes, Eirik
    Graell i Amat, Alexandre
    2021 17TH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS, ISWCS, 2021,
  • [40] EdgeDRNN: Enabling Low-latency Recurrent Neural Network Edge Inference
    Gao, Chang
    Rios-Navarro, Antonio
    Chen, Xi
    Delbruck, Tobi
    Liu, Shih-Chii
    2020 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2020), 2020, : 41 - 45