Reinforcement learning and aggregation

被引:0
|
作者
Jiang, J [1 ]
Kamel, M [1 ]
Chen, L [1 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
来源
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7 | 2004年
关键词
reinforcement learning; multiagent systems; aggregation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) is a learning technique that provides a means for learning an optimal control policy when the dynamics of the environment under consideration is unavailable [7, 13]. While RL has been successfully applied in many, single or multiple agents systems [1, 3, 14, 10], the learning quality is greatly influenced by learning algorithms and their parameters. Setting of the parameters of RL algorithms is something of a black art, and small differences in these parameters can lead to large differences in learning qualities. Determining the best algorithm. and the optimal parameters can be costly in terms of time and computation. Even if the cost is acceptable, the robustness of learning is still. a question. In order to address the difficulty, an Aggregated Multiagent Reinforcement Learning System. (AMRLS) is proposed to deal with the RL environment as a multiagent environment. A maze world environment is used to validate the AMRLS. Experimental results illustrate that compared with normal Q(lambda)-learning and SARSA(lambda) algorithms, the AMRLS increases both the learning speed and the rate of reaching the shortest path.
引用
收藏
页码:1303 / 1308
页数:6
相关论文
共 50 条
  • [41] Generalized learning automata for multi-agent reinforcement learning
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324
  • [42] Reinforcement learning
    Yatawatta, S.
    ASTRONOMY AND COMPUTING, 2024, 48
  • [43] Data-Driven Flight Control of Internet-of-Drones for Sensor Data Aggregation Using Multi-Agent Deep Reinforcement Learning
    Li, Kai
    Ni, Wei
    Emami, Yousef
    Dressler, Falko
    IEEE WIRELESS COMMUNICATIONS, 2022, 29 (04) : 18 - 23
  • [44] A Survey on Reinforcement Learning and Deep Reinforcement Learning for Recommender Systems
    Rezaei, Mehrdad
    Tabrizi, Nasseh
    DEEP LEARNING THEORY AND APPLICATIONS, DELTA 2023, 2023, 1875 : 385 - 402
  • [45] Observational Learning by Reinforcement Learning
    Borsa, Diana
    Heess, Nicolas
    Piot, Bilal
    Liu, Siqi
    Hasenclever, Leonard
    Munos, Remi
    Pietquin, Olivier
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1117 - 1124
  • [46] Curriculum Learning in Reinforcement Learning
    Narvekar, Sanmit
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1528 - 1529
  • [47] A Novel Data Aggregation Mechanism using Reinforcement Learning for Cluster Heads in Wireless Multimedia Sensor Networks
    Uddin J.
    Annals of Emerging Technologies in Computing, 2022, 6 (03) : 69 - 78
  • [48] Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning
    Guan, Cong
    Chen, Feng
    Yuan, Lei
    Zhang, Zongzhang
    Yu, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [49] Scheduling in Multiagent Systems Using Reinforcement Learning
    I. K. Minashina
    R. A. Gorbachev
    E. M. Zakharova
    Doklady Mathematics, 2022, 106 : S70 - S78
  • [50] Reinforcement Learning in the Multi-Robot Domain
    Maja J. Matarić
    Autonomous Robots, 1997, 4 : 73 - 83