Reinforcement learning and aggregation

被引:0
|
作者
Jiang, J [1 ]
Kamel, M [1 ]
Chen, L [1 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
来源
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7 | 2004年
关键词
reinforcement learning; multiagent systems; aggregation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) is a learning technique that provides a means for learning an optimal control policy when the dynamics of the environment under consideration is unavailable [7, 13]. While RL has been successfully applied in many, single or multiple agents systems [1, 3, 14, 10], the learning quality is greatly influenced by learning algorithms and their parameters. Setting of the parameters of RL algorithms is something of a black art, and small differences in these parameters can lead to large differences in learning qualities. Determining the best algorithm. and the optimal parameters can be costly in terms of time and computation. Even if the cost is acceptable, the robustness of learning is still. a question. In order to address the difficulty, an Aggregated Multiagent Reinforcement Learning System. (AMRLS) is proposed to deal with the RL environment as a multiagent environment. A maze world environment is used to validate the AMRLS. Experimental results illustrate that compared with normal Q(lambda)-learning and SARSA(lambda) algorithms, the AMRLS increases both the learning speed and the rate of reaching the shortest path.
引用
收藏
页码:1303 / 1308
页数:6
相关论文
共 50 条
  • [21] Reinforcement learning in a continuum of agents
    Adrian Šošić
    Abdelhak M. Zoubir
    Heinz Koeppl
    Swarm Intelligence, 2018, 12 : 23 - 51
  • [22] Reinforcement learning in a continuum of agents
    Sosic, Adrian
    Zoubir, Abdelhak M.
    Koeppl, Heinz
    SWARM INTELLIGENCE, 2018, 12 (01) : 23 - 51
  • [23] Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification
    Zhang, Wei
    He, Xuanyu
    Lu, Weizhi
    Qiao, Hong
    Li, Yibin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (12) : 3847 - 3852
  • [24] A New Method For Personnel Selection Based On Ranking Aggregation Using A Reinforcement Learning Approach
    Filiberto, Yaima
    Bello, Rafael
    Nowe, Ann
    COMPUTACION Y SISTEMAS, 2018, 22 (02): : 537 - 546
  • [25] The Advance of Reinforcement Learning and Deep Reinforcement Learning
    Lyu, Le
    Shen, Yang
    Zhang, Sicheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648
  • [26] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
    Agostinelli, Forest
    Hocquet, Guillaume
    Singh, Sameer
    Baldi, Pierre
    BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
  • [27] Deep Reinforcement Learning-based SOH-aware Battery Management for DER Aggregation
    Nonaka, Shotaro
    Watari, Daichi
    Taniguchi, Ittetsu
    Onoye, Takao
    PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 471 - 474
  • [28] A REINFORCEMENT LEARNING APPROACH FOR MULTIAGENT NAVIGATION
    Martinez-Gil, Francisco
    Barber, Fernando
    Lozano, Miguel
    Grimaldo, Francisco
    Fernandez, Fernando
    ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 607 - 610
  • [29] Deep Reinforcement Learning for Cyber Security
    Thanh Thi Nguyen
    Reddi, Vijay Janapa
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 3779 - 3795
  • [30] A comprehensive survey of multiagent reinforcement learning
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172