EdgeML: An AutoML Framework for Real-Time Deep Learning on the Edge

被引：33

作者：

Zhao, Zhihe ^{[1
]}

Wang, Kai ^{[2
]}

Ling, Neiwen ^{[1
]}

Xing, Guoliang ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Duke Univ, Durham, NC USA

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET-OF-THINGS DESIGN AND IMPLEMENTATION, IOTDI 2021 | 2021年

关键词：

Reinforcement Learning; Edge Computing; Deep Neural Network; INTERNET;

D O I：

10.1145/3450268.3453520

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, deep learning algorithms are increasingly adopted by a wide range of data-intensive and time-critical Internet of Things (IoT) applications. As a result, several new approaches, including model partition/offloading and progressive neural architecture, have been proposed to address the challenge of deploying the computation-intensive deep neural network (DNN) models on resource-constrained edge devices. However, the performance of existing approaches is highly affected by runtime dynamics. For example, offloading workload from edge to cloud suffers from communication delays and the efficiency of progressive neural architecture supporting early-exit DNN executions relies on input characteristics. In this paper, we introduce EdgeML, an AutoML framework that provides flexible and fine-grained DNN model execution control by combining workload offloading mechanism and dynamic progressive neural architecture. To achieve desirable latency-accuracy-energy system performance on edge platforms, EdgeML adopts reinforcement learning to automatically update model execution policy in response to runtime dynamics in real-time. We implement EdgeML for several widely used DNN models on the latest edge devices. Comparing to existing approaches, our experiments show that EdgeML achieves up to 8x performance improvement under dynamic environments.

引用

页码：133 / 144

页数：12

共 41 条

[1] Real-Time Video Analytics: The Killer App for Edge Computing [J].

Ananthanarayanan, Ganesh ;

Bahl, Paramvir ;

Bodik, Peter ;

Chintalapudi, Krishna ;

Philipose, Matthai ;

Ravindranath, Lenin ;

Sinha, Sudipta .

COMPUTER, 2017, 50 (10) :58-67

[2]

[Anonymous], Apple

[3]

Chen GB, 2017, ADV NEUR IN, V30

[4] An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints [J].

Deb, Kalyanmoy ;

Jain, Himanshu .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (04) :577-601

[5]

Dubout C, 2012, LECT NOTES COMPUT SC, V7574, P301, DOI 10.1007/978-3-642-33712-3_22

[6] FlexDNN: Input-Adaptive On-Device Deep Learning for Efficient Mobile Vision [J].

Fang, Biyi ;

Zeng, Xiao ;

Zhang, Faen ;

Xu, Hui ;

Zhang, Mi .

2020 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING (SEC 2020), 2020, :84-95

[7] NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision [J].

Fang, Biyi ;

Zeng, Xiao ;

Zhang, Mi .

MOBICOM'18: PROCEEDINGS OF THE 24TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, 2018, :115-127

[8]

Feurer M, 2015, ADV NEUR IN, V28

[9]

Gupta S, 2015, PR MACH LEARN RES, V37, P1737

[10]

Han S, 2016, Harvard Yenching Ins, V101, P123

← 1 2 3 4 5 →