Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

被引：60

作者：

Gu, Bo ^{[1
]}

Zhang, Xu ^{[1
]}

Lin, Ziqi ^{[1
]}

Alazab, Mamoun ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China

[2] Charles Darwin Univ, Coll Engn IT & Environm, Darwin, NT 0810, Australia

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 05期

关键词：

DDQN; deep reinforcement learning (DRL); delay critical; device-to-device (D2D); Internet of Things (IoT); spectral efficiency; POWER ALLOCATION; D2D COMMUNICATION; JOINT SUBCARRIER; CHANNEL; INTERFERENCE; ASSIGNMENT; NETWORKS;

D O I：

10.1109/JIOT.2020.3023111

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Ultrareliable and low-latency communication (URLLC) is a prerequisite for the successful implementation of the Internet of Controllable Things. In this article, we investigate the potential of deep reinforcement learning (DRL) for joint subcarrier-power allocation to achieve low latency and high reliability in a general form of device-to-device (D2D) networks, where each subcarrier can be allocated to multiple D2D pairs and each D2D pair is permitted to utilize multiple subcarriers. We first formulate the above problem as a Markov decision process and then propose a double deep Q-network (DQN)-based resource allocation algorithm to learn the optimal policy in the absence of full instantaneous channel state information (CSI). Specifically, each D2D pair acts as a learning agent that adjusts its own subcarrier-power allocation strategy iteratively through interactions with the operating environment in a trial-and-error fashion. Simulation results demonstrate that the proposed algorithm achieves near-optimal performance in real time. It is worth mentioning that the proposed algorithm is especially suitable for cases where the environmental dynamics are not accurate and the CSI delay cannot be ignored.

引用

页码：3066 / 3074

页数：9

共 31 条

[1] TrustE-VC: Trustworthy Evaluation Framework for Industrial Connected Vehicles in the Cloud [J].

Aladwan, Mohammad N. ;

Awaysheh, Feras M. ;

Alawadi, Sadi ;

Alazab, Mamoun ;

Pena, Tomas F. ;

Cabaleiro, Jose C. .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) :6203-6213

[2]

Alazab Mamoun, 2014, Journal of Networks, V9, P2878, DOI 10.4304/jnw.9.11.2878-2891

[3]

[Anonymous], 2018, CISC VIS NETW IND GL

[4] An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks [J].

Asheralieva, Alia ;

Miyanaga, Yoshikazu .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) :3996-4012

[5] Analytical Modeling of Resource Allocation in D2D Overlaying Multihop Multichannel Uplink Cellular Networks [J].

Dai, Jiahao ;

Liu, Jiajia ;

Shi, Yongpeng ;

Zhang, Shubin ;

Ma, Jianfeng .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (08) :6633-6644

[6] Message Passing Based Distributed Learning for Joint Resource Allocation in Millimeter Wave Heterogeneous Networks [J].

Fan, Yawen ;

Zhang, Zhiyang ;

Li, Husheng .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (05) :2872-2885

[7] Joint Power Allocation and Channel Assignment for NOMA With Deep Reinforcement Learning [J].

He, Chaofan ;

Hu, Yang ;

Chen, Yan ;

Zeng, Bing .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) :2200-2210

[8] Distributed Interference and Delay Aware Design for D2D Communication in Large Wireless Networks With Adaptive Interference Estimation [J].

Huang, Sheng ;

Liang, Ben ;

Li, Jiandong .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (06) :3924-3939

[9] Energy-Efficient Joint Resource Allocation and Power Control for D2D Communications [J].

Jiang, Yanxiang ;

Liu, Qiang ;

Zheng, Fuchun ;

Gao, Xiqi ;

You, Xiaohu .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (08) :6119-6127

[10] Joint Subcarrier Assignment With Power Allocation for Sum Rate Maximization of D2D Communications in Wireless Cellular Networks [J].

Kai, Caihong ;

Li, Hui ;

Xu, Lei ;

Li, Yuzhou ;

Jiang, Tao .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) :4748-4759

← 1 2 3 4 →