Topology-Transparent Scheduling Based on Reinforcement Learning in Self-Organized Wireless Networks

被引:8
作者
Qiao, Mu [1 ]
Zhao, Haitao [1 ]
Zhou, Li [1 ]
Zhu, Chunsheng [2 ]
Huang, Shengchun [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci, Changsha 410073, Hunan, Peoples R China
[2] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Topology-transparent scheduling; reinforcement learning; collision avoidance; redundant slot utilization;
D O I
10.1109/ACCESS.2018.2823725
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topology-transparent scheduling policies do not require the maintenance of accurate network topology information and therefore are suitable for highly dynamic scenarios in self-organized wireless networks. However, in topology-transparent scheduling, it is a very challenging problem to make individual nodes efficiently select their transmission slots in a distributed manner. It is desirable for individual nodes, through time slot selection, to avoid collision on the one hand and utilize as many time slots as possible (i.e., minimize the number of redundant slots) on the other. In this paper, learning-based approaches are employed to solve the time slot scheduling problem. Specifically, the proposed method uses a temporal difference learning approach to address the collision issue and use a stochastic gradient descent approach to reduce the number of redundant slots. Unlike previous works, this learning approach is trained through self-play reinforcement learning without incurring communication overhead for the exchange of reservation information, thereby improving the network throughput. Extensive simulation results validate that our proposal can achieve better efficiency than the existing approaches.
引用
收藏
页码:20221 / 20230
页数:10
相关论文
共 36 条
  • [1] A Novel Model for the Time Dependent Competitive Vehicle Routing Problem: Modified Random Topology Particle Swarm Optimization
    Alinaghian, M.
    Ghazanfari, M.
    Norouzi, N.
    Nouralizadeh, H.
    [J]. NETWORKS & SPATIAL ECONOMICS, 2017, 17 (04) : 1185 - 1211
  • [2] [Anonymous], 2015, ARXIV PREPRINT ARXIV
  • [3] [Anonymous], 1994, Adv. Neural Inf. Process. Syst
  • [4] [Anonymous], 2014, Advances in Neural Information Processing Systems
  • [5] Topology-transparent time division multiple access broadcast scheduling in multihop packet radio-networks
    Cai, ZJ
    Lu, M
    Georghiades, CN
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2003, 52 (04) : 970 - 984
  • [6] Chaonong Xu, 2010, Wireless Sensor Network, V2, P801, DOI 10.4236/wsn.2010.210096
  • [7] MAKING TRANSMISSION SCHEDULES IMMUNE TO TOPOLOGY CHANGES IN MULTIHOP PACKET RADIO NETWORKS
    CHLAMTAC, I
    FARAGO, A
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 1994, 2 (01) : 23 - 29
  • [8] Foerster J., 2017, P INT C MACH LEARN S, P1
  • [9] Understanding Application-Battery Interactions on Smartphones: A Large-Scale Empirical Study
    Guo, Yao
    Wang, Chengke
    Chen, Xiangqun
    [J]. IEEE ACCESS, 2017, 5 : 13387 - 13400
  • [10] A Survey on Mobile Social Networks: Applications, Platforms, System Architectures, and Future Research Directions
    Hu, Xiping
    Chu, Terry H. S.
    Leung, Victor C. M.
    Ngai, Edith C. -H.
    Kruchten, Philippe
    Chan, Henry C. B.
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (03) : 1557 - 1581