Reinforcement Learning Based Energy-Efficient Collaborative Inference for Mobile Edge Computing

被引：38

作者：

Xiao, Yilin ^{[1
,2
,3
]}

Xiao, Liang ^{[1
]}

Wan, Kunpeng ^{[1
]}

Yang, Helin ^{[1
]}

Zhang, Yi ^{[1
]}

Wu, Yi ^{[4
]}

Zhang, Yanyong ^{[5
]}

机构：

[1] Xiamen Univ, Dept Informat & Commun Engn, Xiamen 361005, Peoples R China

[2] Shenzhen Inst Artificial Intelligence, Shenzhen 518129, Peoples R China

[3] Robot Soc, Shenzhen 518129, Peoples R China

[4] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350117, Peoples R China

[5] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2023年 / 71卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Mobile handsets; Collaboration; Servers; Energy consumption; Performance evaluation; Deep learning; Computational modeling; Collaborative inference; computation partition; mobile edge computing; multi-agent reinforcement learning; RESOURCE-ALLOCATION; COMPUTATION; IOT; NETWORKS; STORAGE; MEC;

D O I：

10.1109/TCOMM.2022.3229033

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Collaborative inference in mobile edge computing (MEC) enables mobile devices to offload the computation tasks for the computation-intensive perception services, and the inference policy determines the inference latency and energy consumption. The optimal inference policy depends on the inference performance model of deep learning, the data generation model and the network model that are rarely known by mobile devices in time. In this paper, we propose a multi-agent reinforcement learning (RL) based energy-efficient MEC collaborative inference scheme, which enables each mobile device to choose both the partition point of deep learning and the collaborative edge of each mobile device based on the image quantity, the channel conditions and the previous inference performance. A learning experience exchange mechanism exploits the Q-values of the neighboring mobile devices to accelerate the inference policy optimization with less energy consumption. We also provide a deep multi-agent RL based inference scheme to accelerate learning for large-scale MEC networks, in which an actor network yields the collaborative inference policy probability distribution and a critic network guides the weight update of the actor network to enhance sample efficiency. We provide the inference performance bound and analyze the computational complexity. Both simulation and experimental results show that our proposed schemes reduce the inference latency and save the MEC energy consumption.

引用

页码：864 / 876

页数：13

共 49 条

[1] Multi-Agent DRL-Based Hungarian Algorithm (MADRLHA) for Task Offloading in Multi-Access Edge Computing Internet of Vehicles (IoVs) [J].

Alam, Md Zahangir ;

Jamalipour, Abbas .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (09) :7641-7652

[2] Mean-Field-Type Game-Based Computation Offloading in Multi-Access Edge Computing Networks [J].

Banez, Reginald A. ;

Tembine, Hamidou ;

Li, Lixin ;

Yang, Chungang ;

Song, Lingyang ;

Han, Zhu ;

Poor, H. Vincent .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (12) :8366-8381

[3] Auto-Split: A General Framework of Collaborative Edge-Cloud AI [J].

Banitalebi-Dehkordi, Amin ;

Vedula, Naveen ;

Pei, Jian ;

Xia, Fei ;

Wang, Lanjun ;

Zhang, Yong .

KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2543-2553

[4]

Bin Gao, 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications, P1459, DOI 10.1109/INFOCOM.2019.8737543

[5] Deep Learning With Edge Computing: A Review [J].

Chen, Jiasi ;

Ran, Xukan .

PROCEEDINGS OF THE IEEE, 2019, 107 (08) :1655-1674

[6]

Guo XX, 2019, AAAI CONF ARTIF INTE, P3739

[7]

Hu C, 2019, IEEE INFOCOM SER, P1423, DOI [10.1109/INFOCOM.2019.8737614, 10.1109/infocom.2019.8737614]

[8] Learning capability and storage capacity of two-hidden-layer feedforward networks [J].

Huang, GB .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (02) :274-281

[9]

Huang J., 2020, P 26 ANN INT C MOBIL, P1, DOI DOI 10.1145/3372224.3419215

[10] Profiling Energy Profilers [J].

Jagroep, Erik ;

van der Werf, Jan Martijn E. M. ;

Jansen, Slinger ;

Ferreira, Miguel ;

Visser, Joost .

30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, :2198-2203

← 1 2 3 4 5 →