Multi-agent reinforcement learning for cost-aware collaborative task execution in energy-harvesting D2D networks

被引:29
作者
Huang, Binbin [1 ]
Liu, Xiao [2 ]
Wang, Shangguang [3 ]
Pan, Linxuan [4 ]
Chang, Victor [5 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
[4] Nanjing Univ, Software Inst, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[5] Teesside Univ, Sch Comp Engn & Digital Technol, Artificial Intelligence & Informat Syst Res Grp, Middlesbrough, Cleveland, England
基金
美国国家科学基金会;
关键词
D2D networks; collaborative task execution; cost-aware; partially observable Markov decision process; multi-agent deep deterministic policy gradient; DEVICE COMMUNICATION; RESOURCE-ALLOCATION; EDGE; ASSIGNMENT; RADIO; UAV;
D O I
10.1016/j.comnet.2021.108176
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In device-to-device (D2D) networks, multiple resource-limited mobile devices cooperate with one another to execute computation tasks. As the battery capacity of mobile devices is limited, the computation tasks running on the mobile devices will terminate once the battery is dead. In order to achieve sustainable computation, energyharvesting technology has been introduced into D2D networks. At present, how to make multiple energy harvesting mobile devices work collaboratively to minimize the long-term system cost for task execution under limited computing, network and battery capacity constraint is a challenging issue. To deal with such a challenge, in this paper, we design a multi-agent deep deterministic policy gradient (MADDPG) based cost-aware collaborative task-execution (CACTE) scheme in energy harvesting D2D (EH-D2D) networks. To validate the CACTE scheme's performance, we conducted extensive experiments to compare the CACTE scheme with four baseline algorithms, including Local, Random, ECLB (Energy Capacity Load Balance) and CCLB (Computing Capacity Load Balance). Experiments were accompanied by various system parameters, such as the mobile device's battery capacity, task workload, the bandwidth and so on. The experimental results show that the CACTE scheme can make multiple mobile devices cooperate effectively with one another to execute many more tasks and achieve a higher long-term reward, including lower task latency and fewer dropped tasks.
引用
收藏
页数:14
相关论文
共 36 条
[1]  
[Anonymous], ARXIV170306182
[2]  
[Anonymous], 2017, ARXIV170208887
[3]   EXPLOITING MASSIVE D2D COLLABORATION FOR ENERGY-EFFICIENT MOBILE EDGE COMPUTING [J].
Chen, Xu ;
Pu, Lingjun ;
Gao, Lin ;
Wu, Weigang ;
Wu, Di .
IEEE WIRELESS COMMUNICATIONS, 2017, 24 (04) :64-71
[4]   Dynamical Resource Allocation in Edge for Trustable Internet-of-Things Systems: A Reinforcement Learning Method [J].
Deng, Shuiguang ;
Xiang, Zhengzhe ;
Zhao, Peng ;
Taheri, Javid ;
Gao, Honghao ;
Yin, Jianwei ;
Zomaya, Albert Y. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) :6103-6113
[5]  
Du J., 2018, 2018 IEEE INT C COMM, P1, DOI DOI 10.1109/ICC.2018.8422776
[6]   V2VR: Reliable Hybrid-Network-Oriented V2V Data Transmission and Routing Considering RSUs and Connectivity Probability [J].
Gao, Honghao ;
Liu, Can ;
Li, Youhuizi ;
Yang, Xiaoxian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) :3533-3546
[7]   A Truthful Online Mechanism for Collaborative Computation Offloading in Mobile Edge Computing [J].
He, Junyi ;
Zhang, Di ;
Zhou, Yuezhi ;
Zhang, Yaoxue .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (07) :4832-4841
[8]   D2D Communications Meet Mobile Edge Computing for Enhanced Computation Capacity in Cellular Networks [J].
He, Yinghui ;
Ren, Jinke ;
Yu, Guanding ;
Cai, Yunlong .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (03) :1750-1763
[9]   Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks [J].
Huang, Liang ;
Bi, Suzhi ;
Zhang, Ying-Jun Angela .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (11) :2581-2593
[10]   NOMA-Aided Mobile Edge Computing via User Cooperation [J].
Huang, Yuwen ;
Liu, Yuan ;
Chen, Fangjiong .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (04) :2221-2235