Reinforcement learning and aggregation

被引：0

作者：

Jiang, J ^{[1
]}

Kamel, M ^{[1
]}

Chen, L ^{[1
]}

机构：

[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7 | 2004年

关键词：

reinforcement learning; multiagent systems; aggregation;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) is a learning technique that provides a means for learning an optimal control policy when the dynamics of the environment under consideration is unavailable [7, 13]. While RL has been successfully applied in many, single or multiple agents systems [1, 3, 14, 10], the learning quality is greatly influenced by learning algorithms and their parameters. Setting of the parameters of RL algorithms is something of a black art, and small differences in these parameters can lead to large differences in learning qualities. Determining the best algorithm. and the optimal parameters can be costly in terms of time and computation. Even if the cost is acceptable, the robustness of learning is still. a question. In order to address the difficulty, an Aggregated Multiagent Reinforcement Learning System. (AMRLS) is proposed to deal with the RL environment as a multiagent environment. A maze world environment is used to validate the AMRLS. Experimental results illustrate that compared with normal Q(lambda)-learning and SARSA(lambda) algorithms, the AMRLS increases both the learning speed and the rate of reaching the shortest path.

引用

页码：1303 / 1308

页数：6

共 50 条

[21] Reinforcement learning in a continuum of agents
Adrian Šošić
Abdelhak M. Zoubir
Heinz Koeppl
Swarm Intelligence, 2018, 12 : 23 - 51
[22] Reinforcement learning in a continuum of agents
Sosic, Adrian
Zoubir, Abdelhak M.
Koeppl, Heinz
SWARM INTELLIGENCE, 2018, 12 (01) : 23 - 51
[23] Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification
Zhang, Wei
He, Xuanyu
Lu, Weizhi
Qiao, Hong
Li, Yibin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (12) : 3847 - 3852
[24] A New Method For Personnel Selection Based On Ranking Aggregation Using A Reinforcement Learning Approach
Filiberto, Yaima
Bello, Rafael
Nowe, Ann
COMPUTACION Y SISTEMAS, 2018, 22 (02): : 537 - 546
[25] The Advance of Reinforcement Learning and Deep Reinforcement Learning
Lyu, Le
Shen, Yang
Zhang, Sicheng
2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648
[26] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
Agostinelli, Forest
Hocquet, Guillaume
Singh, Sameer
Baldi, Pierre
BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
[27] Deep Reinforcement Learning-based SOH-aware Battery Management for DER Aggregation
Nonaka, Shotaro
Watari, Daichi
Taniguchi, Ittetsu
Onoye, Takao
PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 471 - 474
[28] A REINFORCEMENT LEARNING APPROACH FOR MULTIAGENT NAVIGATION
Martinez-Gil, Francisco
Barber, Fernando
Lozano, Miguel
Grimaldo, Francisco
Fernandez, Fernando
ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 607 - 610
[29] Deep Reinforcement Learning for Cyber Security
Thanh Thi Nguyen
Reddi, Vijay Janapa
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 3779 - 3795
[30] A comprehensive survey of multiagent reinforcement learning
Busoniu, Lucian
Babuska, Robert
De Schutter, Bart
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172

← 1 2 3 4 5 →