Factored Particle Swarm Optimization for Policy Co-training in Reinforcement Learning

被引：1

作者：

France, Kordel K. ^{[1
]}

Sheppard, John W. ^{[2
]}

机构：

[1] Johns Hopkins Univ, Baltimore, MD 21218 USA

[2] Montana State Univ, Bozeman, MT 59717 USA

来源：

PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023 | 2023年

关键词：

factored evolutionary algorithms; co-training; reinforcement learning;

D O I：

10.1145/3583131.3590376

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Uncertainty of the environment limits the circumstances with which any optimization problem can provide meaningful information. Multiple optimizers can combat this problem by communicating di.erent information through cooperative coevolution. In reinforcement learning (RL), uncertainty can be reduced by applying learned policies collaboratively with another agent. Here, we propose policy Co-training with Factored Evolutionary Algorithms (CoFEA) to evolve an optimal policy for such scenarios. We hypothesize that self-paced co-training can allow factored particle swarms with imperfect knowledge to consolidate knowledge from each of their imperfect policies in order to approximate a single optimal policy. Additionally, we show how the performance of co-training swarms of RL agents can be maximized through the speci.c use of Expected SARSA as the policy learner. We evaluate CoFEA against comparable RL algorithms and attempt to establish limits for which our procedure does and does not provide bene.t. Our results indicate that Particle Swarm Optimization (PSO) is e.ective in training multiple agents under uncertainty and that FEA reduces swarm and policy updates. This paper contributes to the.eld of cooperative co-evolutionary algorithms by proposing a method by which factored evolutionary techniques can signi.cantly improve how multiple RL agents collaborate under extreme uncertainty to solve complex tasks faster than a single agent can under identical conditions.

引用

页码：30 / 38

页数：9

共 50 条

[1] Co-training for Policy Learning
Song, Jialin
Lanka, Ravi
Yue, Yisong
Ono, Masahiro
35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 1191 - 1201
[2] Swarm Reinforcement Learning Algorithms Based on Particle Swarm Optimization
Iima, Hitoshi
Kuroe, Yasuaki
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1109 - 1114
[3] Supervised learning and Co-training
Darnstaedt, Malte
Simon, Hans Ulrich
Szoerenyi, Balazs
THEORETICAL COMPUTER SCIENCE, 2014, 519 : 68 - 87
[4] Supervised Learning and Co-training
Darnstaedt, Malte
Simon, Hans Ulrich
Balazs Szoerenyi
ALGORITHMIC LEARNING THEORY, 2011, 6925 : 425 - +
[5] Employing reinforcement learning to enhance particle swarm optimization methods
Wu, Di
Wang, G. Gary
ENGINEERING OPTIMIZATION, 2022, 54 (02) : 329 - 348
[6] Operon Prediction using Particle Swarm Optimization and Reinforcement Learning
Chuang, Li-Yeh
Tsai, Jui-Hung
Yang, Cheng-Hong
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 366 - 372
[7] Integrating Particle Swarm Optimization with Reinforcement Learning in Noisy Problems
Piperagkas, Grigoris S.
Georgoulas, George
Parsopoulos, Kostas E.
Stylios, Chrysostomos D.
Likas, Aristidis C.
PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, : 65 - 72
[8] A reinforcement learning approach for single redundant view co-training text classification
Paiva, Bruno B. M.
Nascimento, Erickson R.
Goncalves, Marcos Andre
Belem, Fabiano
INFORMATION SCIENCES, 2022, 615 : 24 - 38
[9] Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces
Hein, Daniel
Hentschel, Alexander
Runkler, Thomas A.
Udluft, Steffen
INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2016, 7 (03) : 23 - 42
[10] A reinforcement learning-based communication topology in particle swarm optimization
Xu, Yue
Pi, Dechang
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14): : 10007 - 10032

← 1 2 3 4 5 →