Cooperative Q-Learning Based on Learning Automata

被引：0

作者：

Yang, Mao ^{[1
]}

Tian, Yantao ^{[1
]}

Qi, Xinyue ^{[1
]}

机构：

[1] Jilin Univ, Sch Commun Engn, Changchun 130025, Peoples R China

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3 | 2009年

关键词：

multi-robot reinforcement learning; learning automata; Q-learning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The theory of learning automata has already been applied in reinforcement learning which is characterized by single-agent and single-stage. This paper proposed a multi-robot cooperative Q-learning algorithm based on learning automata. Each robot updates probability for action selection through the learning automata constantly, and then converts the probability to special experience. Robots can accelerate the learning process by means of sharing experiences among each other. Simulation experiments verify the effectiveness of this algorithm.

引用

页码：1972 / 1977

页数：6

共 50 条

[1] Learning Automata Based Q-Learning for Content Placement in Cooperative Caching
Yang, Zhong
Liu, Yuanwei
Chen, Yue
Jiao, Lei
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3667 - 3680
[2] Expertness based cooperative Q-learning
Ahmadabadi, MN
Asadpour, M
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
[3] Cooperative Q-Learning Based on Maturity of the Policy
Yang, Mao
Tian, Yantao
Liu, Xiaomei
2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 1352 - 1356
[4] EFFECTS OF COMMUNICATION IN COOPERATIVE Q-LEARNING
Darbyshire, Paul
Wang, Dianhui
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (05): : 2113 - 2126
[5] Multi-criteria expertness based cooperative Q-learning
Esmat Pakizeh
Maziar Palhang
Mir Mohsen Pedram
Applied Intelligence, 2013, 39 : 28 - 40
[6] Multi-criteria expertness based cooperative Q-learning
Pakizeh, Esmat
Palhang, Maziar
Pedram, Mir Mohsen
APPLIED INTELLIGENCE, 2013, 39 (01) : 28 - 40
[7] Cooperative Q-learning: the knowledge sharing issue
Ahmadabadi, MN
Asadpour, M
Nakano, E
ADVANCED ROBOTICS, 2001, 15 (08) : 815 - 832
[8] Cooperative Q-learning based channel selection for cognitive radio networks
Feten Slimeni
Zied Chtourou
Bart Scheers
Vincent Le Nir
Rabah Attia
Wireless Networks, 2019, 25 : 4161 - 4171
[9] Cooperative pursuit with multiple pursuers based on Deep Minimax Q-learning
Ji, Mengda
Xu, Genjiu
Duan, Zekun
Wang, Liying
Li, Zesheng
Ge, Jianjun
Li, Mingqiang
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 146
[10] Cooperative strategy based on adaptive Q-learning for robot soccer systems
Hwang, KS
Tan, SW
Chen, CC
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576

← 1 2 3 4 5 →