Reinforcement learning models for scheduling in wireless networks

被引：10

作者：

Yau, Kok-Lim Alvin ^{[1
]}

Kwong, Kae Hsiang ^{[2
]}

Shen, Chong ^{[3
]}

机构：

[1] Sunway Univ, Fac Sci & Technol, Selangor 46150, Malaysia

[2] Recovision, R&D Dept, Selangor 47650, Malaysia

[3] Hainan Univ, Coll Informat Sci & Technol, Haikou 570228, Peoples R China

来源：

FRONTIERS OF COMPUTER SCIENCE | 2013年 / 7卷 / 05期

关键词：

reinforcement learning; scheduling; wireless networks; DELAY;

D O I：

10.1007/s11704-013-2291-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The dynamicity of available resources and network conditions, such as channel capacity and traffic characteristics, have posed major challenges to scheduling in wireless networks. Reinforcement learning (RL) enables wireless nodes to observe their respective operating environment, learn, and make optimal or near-optimal scheduling decisions. Learning, which is the main intrinsic characteristic of RL, enables wireless nodes to adapt to most forms of dynamicity in the operating environment as time goes by. This paper presents an extensive review on the application of the traditional and enhanced RL approaches to various types of scheduling schemes, namely packet, sleep-wake and task schedulers, in wireless networks, as well as the advantages and performance enhancements brought about by RL. Additionally, it presents how various challenges associated with scheduling schemes have been approached using RL. Finally, we discuss various open issues related to RL-based scheduling schemes in wireless networks in order to explore new research directions in this area. Discussions in this paper are presented in a tutorial manner in order to establish a foundation for further research in this field.

引用

页码：754 / 766

页数：13

共 27 条

[1]

[Anonymous], P IEEE INT C COMP CO

[2] Adaptive Opportunistic Routing for Wireless Ad Hoc Networks [J].

Bhorkar, Abhijeet A. ;

Naghshvar, Mohammad ;

Javidi, Tara ;

Rao, Bhaskar D. .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2012, 20 (01) :243-256

[3] Analytical Modeling for Delay-Sensitive Video Over WLAN [J].

Bobarshad, Hossein ;

van der Schaar, Mihaela ;

Aghvami, A. Hamid ;

Dilmaghani, Reza S. ;

Shikh-Bahaei, Mohammad R. .

IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (02) :401-414

[4]

Bourenane M., 2011, 2011 International Conference on Innovations in Information Technology (IIT), P392, DOI 10.1109/INNOVATIONS.2011.5893856

[5]

Engineering Systems Division (ESD), 2002, P MASS I TECHN ESD I

[6]

Jianjun Niu, 2010, Proceedings of the 2010 2nd International Conference on Future Computer and Communication (ICFCC 2010), P253, DOI 10.1109/ICFCC.2010.5497643

[7] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[8]

Khan M. I., 2012, 2012 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), P895, DOI 10.1109/PerComW.2012.6197639

[9]

Kok JR, 2006, J MACH LEARN RES, V7, P1789

[10] Servicing Wireless Sensor Networks by Mobile Robots [J].

Li, Xu ;

Falcon, Rafael ;

Nayak, Amiya ;

Stojmenovic, Ivan .

IEEE COMMUNICATIONS MAGAZINE, 2012, 50 (07) :147-154

← 1 2 3 →