Cooperative Q-Learning Based on Learning Automata

被引:0
|
作者
Yang, Mao [1 ]
Tian, Yantao [1 ]
Qi, Xinyue [1 ]
机构
[1] Jilin Univ, Sch Commun Engn, Changchun 130025, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3 | 2009年
关键词
multi-robot reinforcement learning; learning automata; Q-learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The theory of learning automata has already been applied in reinforcement learning which is characterized by single-agent and single-stage. This paper proposed a multi-robot cooperative Q-learning algorithm based on learning automata. Each robot updates probability for action selection through the learning automata constantly, and then converts the probability to special experience. Robots can accelerate the learning process by means of sharing experiences among each other. Simulation experiments verify the effectiveness of this algorithm.
引用
收藏
页码:1972 / 1977
页数:6
相关论文
共 50 条
  • [1] Learning Automata Based Q-Learning for Content Placement in Cooperative Caching
    Yang, Zhong
    Liu, Yuanwei
    Chen, Yue
    Jiao, Lei
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3667 - 3680
  • [2] Expertness based cooperative Q-learning
    Ahmadabadi, MN
    Asadpour, M
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
  • [3] Cooperative Q-Learning Based on Maturity of the Policy
    Yang, Mao
    Tian, Yantao
    Liu, Xiaomei
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 1352 - 1356
  • [4] EFFECTS OF COMMUNICATION IN COOPERATIVE Q-LEARNING
    Darbyshire, Paul
    Wang, Dianhui
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (05): : 2113 - 2126
  • [5] Multi-criteria expertness based cooperative Q-learning
    Esmat Pakizeh
    Maziar Palhang
    Mir Mohsen Pedram
    Applied Intelligence, 2013, 39 : 28 - 40
  • [6] Multi-criteria expertness based cooperative Q-learning
    Pakizeh, Esmat
    Palhang, Maziar
    Pedram, Mir Mohsen
    APPLIED INTELLIGENCE, 2013, 39 (01) : 28 - 40
  • [7] Cooperative Q-learning: the knowledge sharing issue
    Ahmadabadi, MN
    Asadpour, M
    Nakano, E
    ADVANCED ROBOTICS, 2001, 15 (08) : 815 - 832
  • [8] Cooperative Q-learning based channel selection for cognitive radio networks
    Feten Slimeni
    Zied Chtourou
    Bart Scheers
    Vincent Le Nir
    Rabah Attia
    Wireless Networks, 2019, 25 : 4161 - 4171
  • [9] Cooperative pursuit with multiple pursuers based on Deep Minimax Q-learning
    Ji, Mengda
    Xu, Genjiu
    Duan, Zekun
    Wang, Liying
    Li, Zesheng
    Ge, Jianjun
    Li, Mingqiang
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 146
  • [10] Cooperative strategy based on adaptive Q-learning for robot soccer systems
    Hwang, KS
    Tan, SW
    Chen, CC
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576