Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things

被引：3

作者：

Park, Jeongsoo ^{[1
]}

Kim, Jungrae ^{[2
]}

Ko, Jong Hwan ^{[1
]}

机构：

[1] Sungkyunkwan Univ, Dept Elect & Elect Engn, Suwon 16419, Gyeonggi Do, South Korea

[2] Sungkyunkwan Univ, Dept Semicond Syst Engn, Suwon 16419, Gyeonggi Do, South Korea

来源：

SENSORS | 2021年 / 21卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

collaborative intelligence; deep feature compression; inference partitioning; autoencoder; convolutional neural network; deep learning; machine learning; distributed computation; Internet of Things;

D O I：

10.3390/s21030896

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Due to limited resources of the Internet of Things (IoT) edge devices, deep neural network (DNN) inference requires collaboration with cloud server platforms, where DNN inference is partitioned and offloaded to high-performance servers to reduce end-to-end latency. As data-intensive intermediate feature space at the partitioned layer should be transmitted to the servers, efficient compression of the feature space is imperative for high-throughput inference. However, the feature space at deeper layers has different characteristics than natural images, limiting the compression performance by conventional preprocessing and encoding techniques. To tackle this limitation, we introduce a new method for compressing DNN intermediate feature space using a specialized autoencoder, called auto-tiler. The proposed auto-tiler is designed to include the tiling process and provide multiple input/output dimensions to support various partitioned layers and compression ratios. The results show that auto-tiler achieves 18% to 67% higher percent point accuracy compared to the existing methods at the same bitrate while reducing the process latency by 73% to 81%. The dimension variability of an auto-tiler also reduces the storage overhead by 62% with negligible accuracy loss.

引用

页码：1 / 17

页数：17

共 31 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

Anh H.N., 2018, YOLO3 DETECTION TRAI

[3]

[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4

[4]

Chayapathy V, 2017, PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), P385, DOI 10.1109/SmartTechCon.2017.8358401

[5] Lossy Intermediate Deep Learning Feature Compression and Evaluation [J].

Chen, Zhuo ;

Fan, Kui ;

Wang, Shiqi ;

Duan, Ling-Yu ;

Lin, Weisi ;

Kot, Alex C. .

PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :2414-2422

[6]

Choi H.S., 2018, 2018 International Workshop on Advanced Image Technology, IWAIT 2018, V30, DOI DOI 10.1109/IWAIT.2018.8369653OF

[7]

Choi H, 2018, IEEE IMAGE PROC, P3743, DOI 10.1109/ICIP.2018.8451100

[8] BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services [J].

Eshratifar, Amir Erfan ;

Esmaili, Amirhossein ;

Pedram, Massoud .

2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,

[9]

Everingham M., 2012, The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results

[10] The impact of image information on compressibility and degradation in medical image compression [J].

Fidler, Ales ;

Skaleric, Uros ;

Likar, Bostjan .

MEDICAL PHYSICS, 2006, 33 (08) :2832-2838

← 1 2 3 4 →