CAE-Net: Enhanced Converting Autoencoder based Framework for Low-latency Energy-efficient DNN with SLO-constraints

被引:0
|
作者
Mahmud, Hasanul [1 ]
Kang, Peng [1 ]
Lama, Palden [1 ]
Desai, Kevin [1 ]
Prasad, Sushil K. [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Converting Autoencoder; Low latency; Energy efficiency; Edge devices; DNN compression; INFERENCE;
D O I
10.1109/Cloud-Summit61220.2024.00028
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As deep neural networks (DNNs) continue to be used on resource-limited edge devices with low latency requirements for interactive applications, there is a growing need to reduce inference time and energy consumption while maintaining acceptable prediction accuracy. In response, we introduce a novel framework, CAE-Net, for designing and training lightweight and energyefficient deep neural networks (DNNs) for image classification on edge devices. The proposed framework consists of two parts: (1) a new Enhanced Converting Autoencoder that employs entropy-based intraclass clustering to learn the key image features by transforming the hard images into easy representative images, and (2) a composite lightweight CAE-Net classifier employing the pre-trained encoder of the Converting Autoencoder followed by a few classification layers from a baseline DNN trained using knowledge transfer. Unlike many state-of-the-art models, our experimental results using popular image-classification datasets, MNIST and CIFAR10 demonstrate that CAE-Net can satisfy the inference latency target of 10-20ms on Raspberry Pi and 5-10 ms on Nvidia Jetson Nano. Compared with the competing models meeting the SLO targets, CAE-Net achieves over 4-fold energy reduction and inferencing latency speedups on the CIFAR-10 dataset compared to AlexNet and its pruned/distilled variants and other DNNs on Raspberry Pi and about 6-fold on Jetson Nano while maintaining similar or higher accuracy.
引用
收藏
页码:128 / 134
页数:7
相关论文
共 18 条
  • [1] A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
    Mahmud, Hasanul
    Kang, Peng
    Desai, Kevin
    Lama, Palden
    Prasad, Sushil K.
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 592 - 599
  • [2] Low-latency and energy-efficient scheduling in fog-based IoT applications
    Rahbari, Dadmehr
    Nickray, Mohsen
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1406 - 1427
  • [3] Energy-Efficient Low-Latency Signed Multiplier for FPGA-Based Hardware Accelerators
    Ullah, Salim
    Nguyen, Tuan Duy Anh
    Kumar, Akash
    IEEE EMBEDDED SYSTEMS LETTERS, 2021, 13 (02) : 41 - 44
  • [4] A Low-Latency and Energy-Efficient Multimetric Routing Protocol Based on Network Connectivity in VANET Communication
    Wang, Xiaobo
    Weng, Yu
    Gao, Honghao
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2021, 5 (04): : 1761 - 1776
  • [5] Tree-Model Based Mapping for Energy-Efficient and Low-Latency Network-on-Chip
    Yang, Bo
    Xu, Thomas Canhao
    Santti, Tero
    Plosila, Juha
    PROCEEDINGS OF THE 13TH IEEE SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, 2010, : 189 - 192
  • [6] A Co-Design-Based Reliable Low-Latency and Energy-Efficient Transmission Protocol for UWSNs
    Wei, Xiaohui
    Guo, Hao
    Wang, Xingwang
    Wang, Xiaonan
    Wang, Chu
    Guizani, Mohsen
    Du, Xiaojiang
    SENSORS, 2020, 20 (21) : 1 - 22
  • [7] AdaBoost-powered cloud of things framework for low-latency, energy-efficient chronic kidney disease prediction
    Al-Kateeb, Zeena N.
    Abdullah, Dhuha Basheer
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (06):
  • [8] An adaptive energy-efficient and low-latency MAC for tree-based data gathering in sensor networks
    Lu, Gang
    Krishnamachari, Bhaskar
    Raghavendra, Cauligi S.
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2007, 7 (07): : 863 - 875
  • [9] A Low-Latency, Energy-Efficient L1 Cache Based on a Self-Timed Pipeline
    Trudeau, L. -C.
    Gagnon, G.
    Gagnon, F.
    Thibeault, C.
    Awad, T.
    Morrissey, D.
    21ST IEEE INTERNATIONAL SYMPOSIUM ON ASYNCHRONOUS CIRCUITS AND SYSTEMS (ASYNC 2015), 2015, : 17 - 18
  • [10] Pairwise-Combination-Based DBA Scheme for Energy-efficient Low-latency TDM-PONs
    Zhu, Min
    Li, Guixin
    Zhang, Shengyu
    Gu, Jiahua
    Chen, Bin
    Sun, Qin
    2018 10TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT), 2018, : 128 - 132