Topology-Transparent Scheduling Based on Reinforcement Learning in Self-Organized Wireless Networks

被引：8

作者：

Qiao, Mu ^{[1
]}

Zhao, Haitao ^{[1
]}

Zhou, Li ^{[1
]}

Zhu, Chunsheng ^{[2
]}

Huang, Shengchun ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Elect Sci, Changsha 410073, Hunan, Peoples R China

[2] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada

来源：

IEEE ACCESS | 2018年 / 6卷

基金：

中国国家自然科学基金;

关键词：

Topology-transparent scheduling; reinforcement learning; collision avoidance; redundant slot utilization;

D O I：

10.1109/ACCESS.2018.2823725

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Topology-transparent scheduling policies do not require the maintenance of accurate network topology information and therefore are suitable for highly dynamic scenarios in self-organized wireless networks. However, in topology-transparent scheduling, it is a very challenging problem to make individual nodes efficiently select their transmission slots in a distributed manner. It is desirable for individual nodes, through time slot selection, to avoid collision on the one hand and utilize as many time slots as possible (i.e., minimize the number of redundant slots) on the other. In this paper, learning-based approaches are employed to solve the time slot scheduling problem. Specifically, the proposed method uses a temporal difference learning approach to address the collision issue and use a stochastic gradient descent approach to reduce the number of redundant slots. Unlike previous works, this learning approach is trained through self-play reinforcement learning without incurring communication overhead for the exchange of reservation information, thereby improving the network throughput. Extensive simulation results validate that our proposal can achieve better efficiency than the existing approaches.

引用

页码：20221 / 20230

页数：10

共 50 条

[21] Self-organized task allocation between reinforcement learning robots and a human partner [J].

Yasuda, Toshiyuki ;

Nomura, Soichiro ;

Ohkura, Kazuhiro .

International Journal of Advancements in Computing Technology, 2012, 4 (22) :230-238

[22] Multi-agent reinforcement learning based dynamic self-coordinated topology optimization for wireless mesh networks [J].

Tang, Qingwei ;

Sun, Wei ;

Liu, Zhi ;

Li, Qiyue ;

Yuan, Xiaohui .

JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2025, 239

[23] Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks [J].

Avranas, Apostolos ;

Ciblat, Philippe ;

Kountouris, Marios .

IEEE TRANSACTIONS ON MACHINE LEARNING IN COMMUNICATIONS AND NETWORKING, 2023, 1 :225-241

[24] Energy-aware Task Scheduling in Wireless Sensor Networks based on Cooperative Reinforcement Learning [J].

Khan, Muhidul Islam ;

Rinner, Bernhard .

2014 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC), 2014, :871-877

[25] A Self-Adaptive Routing Paradigm for Wireless Mesh Networks Based on Reinforcement Learning [J].

Nurchis, Maddalena ;

Bruno, Raffaele ;

Conti, Marco ;

Lenzini, Luciano .

MSWIM 11: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON MODELING, ANALYSIS, AND SIMULATION OF WIRELESS AND MOBILE SYSTEMS, 2011, :197-204

[26] Reinforcement learning based routing in wireless mesh networks [J].

Boushaba, Mustapha ;

Hafid, Abdelhakim ;

Belbekkouche, Abdeltouab ;

Gendreau, Michel .

WIRELESS NETWORKS, 2013, 19 (08) :2079-2091

[27] Differentiated Service Based on Reinforcement Learning in Wireless Networks [J].

Bourenane, Malika .

INTELLIGENT INFORMATICS, 2013, 182 :417-424

[28] A Reinforcement Learning Based Data Caching in Wireless Networks [J].

Sheraz, Muhammad ;

Shafique, Shahryar ;

Imran, Sohail ;

Asif, Muhammad ;

Ullah, Rizwan ;

Ibrar, Muhammad ;

Khan, Jahanzeb ;

Wuttisittikulkij, Lunchakorn .

APPLIED SCIENCES-BASEL, 2022, 12 (11)

[29] Reinforcement learning based routing in wireless mesh networks [J].

Mustapha Boushaba ;

Abdelhakim Hafid ;

Abdeltouab Belbekkouche ;

Michel Gendreau .

Wireless Networks, 2013, 19 :2079-2091

[30] Decentralised reinforcement learning for energy-efficient scheduling in wireless sensor networks [J].

Mihaylov, Mihail ;

Le Borgne, Yann-Ael .

INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2012, 9 (3-4) :207-224

← 1 2 3 4 5 →