Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

被引:56
作者
Gu, Bo [1 ]
Zhang, Xu [1 ]
Lin, Ziqi [1 ]
Alazab, Mamoun [2 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China
[2] Charles Darwin Univ, Coll Engn IT & Environm, Darwin, NT 0810, Australia
关键词
DDQN; deep reinforcement learning (DRL); delay critical; device-to-device (D2D); Internet of Things (IoT); spectral efficiency; POWER ALLOCATION; D2D COMMUNICATION; JOINT SUBCARRIER; CHANNEL; INTERFERENCE; ASSIGNMENT; NETWORKS;
D O I
10.1109/JIOT.2020.3023111
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ultrareliable and low-latency communication (URLLC) is a prerequisite for the successful implementation of the Internet of Controllable Things. In this article, we investigate the potential of deep reinforcement learning (DRL) for joint subcarrier-power allocation to achieve low latency and high reliability in a general form of device-to-device (D2D) networks, where each subcarrier can be allocated to multiple D2D pairs and each D2D pair is permitted to utilize multiple subcarriers. We first formulate the above problem as a Markov decision process and then propose a double deep Q-network (DQN)-based resource allocation algorithm to learn the optimal policy in the absence of full instantaneous channel state information (CSI). Specifically, each D2D pair acts as a learning agent that adjusts its own subcarrier-power allocation strategy iteratively through interactions with the operating environment in a trial-and-error fashion. Simulation results demonstrate that the proposed algorithm achieves near-optimal performance in real time. It is worth mentioning that the proposed algorithm is especially suitable for cases where the environmental dynamics are not accurate and the CSI delay cannot be ignored.
引用
收藏
页码:3066 / 3074
页数:9
相关论文
共 31 条
  • [1] TrustE-VC: Trustworthy Evaluation Framework for Industrial Connected Vehicles in the Cloud
    Aladwan, Mohammad N.
    Awaysheh, Feras M.
    Alawadi, Sadi
    Alazab, Mamoun
    Pena, Tomas F.
    Cabaleiro, Jose C.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) : 6203 - 6213
  • [2] Alazab Mamoun, 2014, Journal of Networks, V9, P2878, DOI 10.4304/jnw.9.11.2878-2891
  • [3] [Anonymous], 2018, CISC VIS NETW IND GL
  • [4] An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks
    Asheralieva, Alia
    Miyanaga, Yoshikazu
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) : 3996 - 4012
  • [5] Analytical Modeling of Resource Allocation in D2D Overlaying Multihop Multichannel Uplink Cellular Networks
    Dai, Jiahao
    Liu, Jiajia
    Shi, Yongpeng
    Zhang, Shubin
    Ma, Jianfeng
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (08) : 6633 - 6644
  • [6] Message Passing Based Distributed Learning for Joint Resource Allocation in Millimeter Wave Heterogeneous Networks
    Fan, Yawen
    Zhang, Zhiyang
    Li, Husheng
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (05) : 2872 - 2885
  • [7] Joint Power Allocation and Channel Assignment for NOMA With Deep Reinforcement Learning
    He, Chaofan
    Hu, Yang
    Chen, Yan
    Zeng, Bing
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) : 2200 - 2210
  • [8] Distributed Interference and Delay Aware Design for D2D Communication in Large Wireless Networks With Adaptive Interference Estimation
    Huang, Sheng
    Liang, Ben
    Li, Jiandong
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (06) : 3924 - 3939
  • [9] Energy-Efficient Joint Resource Allocation and Power Control for D2D Communications
    Jiang, Yanxiang
    Liu, Qiang
    Zheng, Fuchun
    Gao, Xiqi
    You, Xiaohu
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (08) : 6119 - 6127
  • [10] Joint Subcarrier Assignment With Power Allocation for Sum Rate Maximization of D2D Communications in Wireless Cellular Networks
    Kai, Caihong
    Li, Hui
    Xu, Lei
    Li, Yuzhou
    Jiang, Tao
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4748 - 4759