Reinforcement learning and aggregation

被引：0

作者：

Jiang, J ^{[1
]}

Kamel, M ^{[1
]}

Chen, L ^{[1
]}

机构：

[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7 | 2004年

关键词：

reinforcement learning; multiagent systems; aggregation;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) is a learning technique that provides a means for learning an optimal control policy when the dynamics of the environment under consideration is unavailable [7, 13]. While RL has been successfully applied in many, single or multiple agents systems [1, 3, 14, 10], the learning quality is greatly influenced by learning algorithms and their parameters. Setting of the parameters of RL algorithms is something of a black art, and small differences in these parameters can lead to large differences in learning qualities. Determining the best algorithm. and the optimal parameters can be costly in terms of time and computation. Even if the cost is acceptable, the robustness of learning is still. a question. In order to address the difficulty, an Aggregated Multiagent Reinforcement Learning System. (AMRLS) is proposed to deal with the RL environment as a multiagent environment. A maze world environment is used to validate the AMRLS. Experimental results illustrate that compared with normal Q(lambda)-learning and SARSA(lambda) algorithms, the AMRLS increases both the learning speed and the rate of reaching the shortest path.

引用

页码：1303 / 1308

页数：6

共 50 条

[41] Generalized learning automata for multi-agent reinforcement learning
De Hauwere, Yann-Michael
Vrancx, Peter
Nowe, Ann
AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324
[42] Reinforcement learning
Yatawatta, S.
ASTRONOMY AND COMPUTING, 2024, 48
[43] Data-Driven Flight Control of Internet-of-Drones for Sensor Data Aggregation Using Multi-Agent Deep Reinforcement Learning
Li, Kai
Ni, Wei
Emami, Yousef
Dressler, Falko
IEEE WIRELESS COMMUNICATIONS, 2022, 29 (04) : 18 - 23
[44] A Survey on Reinforcement Learning and Deep Reinforcement Learning for Recommender Systems
Rezaei, Mehrdad
Tabrizi, Nasseh
DEEP LEARNING THEORY AND APPLICATIONS, DELTA 2023, 2023, 1875 : 385 - 402
[45] Observational Learning by Reinforcement Learning
Borsa, Diana
Heess, Nicolas
Piot, Bilal
Liu, Siqi
Hasenclever, Leonard
Munos, Remi
Pietquin, Olivier
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1117 - 1124
[46] Curriculum Learning in Reinforcement Learning
Narvekar, Sanmit
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1528 - 1529
[47] A Novel Data Aggregation Mechanism using Reinforcement Learning for Cluster Heads in Wireless Multimedia Sensor Networks
Uddin J.
Annals of Emerging Technologies in Computing, 2022, 6 (03) : 69 - 78
[48] Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning
Guan, Cong
Chen, Feng
Yuan, Lei
Zhang, Zongzhang
Yu, Yang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[49] Scheduling in Multiagent Systems Using Reinforcement Learning
I. K. Minashina
R. A. Gorbachev
E. M. Zakharova
Doklady Mathematics, 2022, 106 : S70 - S78
[50] Reinforcement Learning in the Multi-Robot Domain
Maja J. Matarić
Autonomous Robots, 1997, 4 : 73 - 83

← 1 2 3 4 5 →