A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge

被引：1

作者：

Mahmud, Hasanul ^{[1
]}

Kang, Peng ^{[1
]}

Desai, Kevin ^{[1
]}

Lama, Palden ^{[1
]}

Prasad, Sushil K. ^{[1
]}

机构：

[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA

来源：

2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024 | 2024年

关键词：

Energy-efficiency; Deep Neural Networks; Edge Computing; Early-exit DNNs; Converting Autoencoder;

D O I：

10.1109/IPDPSW63119.2024.00117

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reducing inference time and energy usage while maintaining prediction accuracy has become a significant concern for deep neural networks (DNN) inference on resourcecon-strained edge devices. To address this problem, we propose a novel approach based on "converting" autoencoder and lightweight DNNs. This improves upon recent work such as early-exiting framework and DNN partitioning. Early-exiting frameworks spend different amounts of computation power for different input data depending upon their complexity. However, they can be inefficient in real-world scenarios that deal with many hard image samples. On the other hand, DNN partitioning algorithms that utilize the computation power of both the cloud and edge devices can be affected by network delays and intermittent connections between the cloud and the edge. We present CBNet, a low-latency and energy-efficient DNN inference framework tailored for edge devices. It utilizes a "converting" autoencoder to efficiently transform hard images into easy ones, which are subsequently processed by a lightweight DNN for inference. To the best of our knowledge, such autoencoder has not been proposed earlier. Our experimental results using three popular image-classification datasets on a Raspberry Pi 4, a Google Cloud instance, and an instance with Nvidia Tesla K80 GPU show that CBNet achieves up to 4.8 x speedup in inference latency and 79% reduction in energy usage compared to competing techniques while maintaining similar or higher accuracy.

引用

页码：592 / 599

页数：8

共 50 条

[31] Panacea: A Low-Latency, Energy-Efficient Neighbor Discovery Protocol for Wireless Sensor Networks
Cao, Zhen
Gu, Zhaoquan
Wang, Yuexuan
Cui, Heming
2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2018,
[32] Energy-Efficient Low-Latency Signed Multiplier for FPGA-Based Hardware Accelerators
Ullah, Salim
Nguyen, Tuan Duy Anh
Kumar, Akash
IEEE EMBEDDED SYSTEMS LETTERS, 2021, 13 (02) : 41 - 44
[33] Low-Latency Smart Grid Asset Monitoring for Load Control of Energy-Efficient Buildings
Al-Anbagi, Irfan
Erol-Kantarci, Melike
Mouftah, Hussein T.
2012 IEEE International Conference on Smart Grid Engineering (SGE), 2012,
[34] Study on the Solutions to Heterogeneous ONU Propagation Delays for Energy-Efficient and Low-Latency EPONs
Lv, Yunxin
Bi, Meihua
Zhai, Yanrong
Chi, Hao
Wang, Yuxi
IEEE ACCESS, 2020, 8 : 193665 - 193680
[35] LEoNIDS: A Low-Latency and Energy-Efficient Network-Level Intrusion Detection System
Tsikoudis, Nikos
Papadogiannakis, Antonis
Markatos, Evangelos P.
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2016, 4 (01) : 142 - 155
[36] On the design of an energy-efficient low-latency integrated protocol for distributed mobile sensor networks
Ruzzelli, AG
Evers, L
Dulman, S
van Hoesel, LFW
Havinga, PJM
2004 INTERNATIONAL WORKSHOP ON WIRELESS AD-HOC NETWORKS, 2005, : 35 - 44
[37] Energy-Efficient, Low-Latency Realization of Neural Networks through Boolean Logic Minimization
Nazemi, Mahdi
Pasandi, Ghasem
Pedram, Massoud
24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 274 - 279
[38] U-Connect: A Low-Latency Energy-Efficient Asynchronous Neighbor Discovery Protocol
Kandhalu, Arvind
Lakshmanan, Karthik
Rajkumar, Ragunathan
PROCEEDINGS OF THE 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2010, : 350 - 361
[39] Low-Latency Distributed Inference at the Network Edge Using Rateless Codes
Frigard, Anton
Kumar, Siddhartha
Rosnes, Eirik
Graell i Amat, Alexandre
2021 17TH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS, ISWCS, 2021,
[40] EdgeDRNN: Enabling Low-latency Recurrent Neural Network Edge Inference
Gao, Chang
Rios-Navarro, Antonio
Chen, Xi
Delbruck, Tobi
Liu, Shih-Chii
2020 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2020), 2020, : 41 - 45

← 1 2 3 4 5 →