Factored Particle Swarm Optimization for Policy Co-training in Reinforcement Learning

被引:1
|
作者
France, Kordel K. [1 ]
Sheppard, John W. [2 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Montana State Univ, Bozeman, MT 59717 USA
来源
PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023 | 2023年
关键词
factored evolutionary algorithms; co-training; reinforcement learning;
D O I
10.1145/3583131.3590376
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Uncertainty of the environment limits the circumstances with which any optimization problem can provide meaningful information. Multiple optimizers can combat this problem by communicating di.erent information through cooperative coevolution. In reinforcement learning (RL), uncertainty can be reduced by applying learned policies collaboratively with another agent. Here, we propose policy Co-training with Factored Evolutionary Algorithms (CoFEA) to evolve an optimal policy for such scenarios. We hypothesize that self-paced co-training can allow factored particle swarms with imperfect knowledge to consolidate knowledge from each of their imperfect policies in order to approximate a single optimal policy. Additionally, we show how the performance of co-training swarms of RL agents can be maximized through the speci.c use of Expected SARSA as the policy learner. We evaluate CoFEA against comparable RL algorithms and attempt to establish limits for which our procedure does and does not provide bene.t. Our results indicate that Particle Swarm Optimization (PSO) is e.ective in training multiple agents under uncertainty and that FEA reduces swarm and policy updates. This paper contributes to the.eld of cooperative co-evolutionary algorithms by proposing a method by which factored evolutionary techniques can signi.cantly improve how multiple RL agents collaborate under extreme uncertainty to solve complex tasks faster than a single agent can under identical conditions.
引用
收藏
页码:30 / 38
页数:9
相关论文
共 50 条
  • [1] Co-training for Policy Learning
    Song, Jialin
    Lanka, Ravi
    Yue, Yisong
    Ono, Masahiro
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 1191 - 1201
  • [2] Swarm Reinforcement Learning Algorithms Based on Particle Swarm Optimization
    Iima, Hitoshi
    Kuroe, Yasuaki
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1109 - 1114
  • [3] Supervised learning and Co-training
    Darnstaedt, Malte
    Simon, Hans Ulrich
    Szoerenyi, Balazs
    THEORETICAL COMPUTER SCIENCE, 2014, 519 : 68 - 87
  • [4] Supervised Learning and Co-training
    Darnstaedt, Malte
    Simon, Hans Ulrich
    Balazs Szoerenyi
    ALGORITHMIC LEARNING THEORY, 2011, 6925 : 425 - +
  • [5] Employing reinforcement learning to enhance particle swarm optimization methods
    Wu, Di
    Wang, G. Gary
    ENGINEERING OPTIMIZATION, 2022, 54 (02) : 329 - 348
  • [6] Operon Prediction using Particle Swarm Optimization and Reinforcement Learning
    Chuang, Li-Yeh
    Tsai, Jui-Hung
    Yang, Cheng-Hong
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 366 - 372
  • [7] Integrating Particle Swarm Optimization with Reinforcement Learning in Noisy Problems
    Piperagkas, Grigoris S.
    Georgoulas, George
    Parsopoulos, Kostas E.
    Stylios, Chrysostomos D.
    Likas, Aristidis C.
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, : 65 - 72
  • [8] A reinforcement learning approach for single redundant view co-training text classification
    Paiva, Bruno B. M.
    Nascimento, Erickson R.
    Goncalves, Marcos Andre
    Belem, Fabiano
    INFORMATION SCIENCES, 2022, 615 : 24 - 38
  • [9] Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces
    Hein, Daniel
    Hentschel, Alexander
    Runkler, Thomas A.
    Udluft, Steffen
    INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2016, 7 (03) : 23 - 42
  • [10] A reinforcement learning-based communication topology in particle swarm optimization
    Xu, Yue
    Pi, Dechang
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14): : 10007 - 10032