POET: Open-Ended Coevolution of Environments and their Optimized Solutions

被引:32
作者
Wang, Rui [1 ]
Lehman, Joel [1 ]
Clune, Jeff [1 ]
Stanley, Kenneth O. [1 ]
机构
[1] Uber AI, San Francisco, CA 94103 USA
来源
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19) | 2019年
关键词
Open-ended evolution; coevolution; evolution strategies; novelty search; artificial life; EVOLUTION;
D O I
10.1145/3321707.3321799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How can progress in machine learning and reinforcement learning be automated to generate its own never-ending curriculum of challenges without human intervention? The recent emergence of quality diversity (QD) algorithms offers a glimpse of the potential for such continual open-ended invention. For example, novelty search showcases the benefits of explicit novelty pressure, MAP-Elites and Innovation Engines highlight the advantage of explicit elitism within niches in an otherwise divergent process, and minimal criterion coevolution (MCC) reveals that problems and solutions can coevolve divergently. The Paired Open-Ended Trailblazer (POET) algorithm introduced in this paper combines these principles to produce a practical approach to generating an endless progression of diverse and increasingly challenging environments while at the same time explicitly optimizing their solutions. An intriguing implication is the opportunity to transfer solutions among environments, reflecting the view that innovation is a circuitous and unpredictable process. POET is tested in a 2-D obstacles course domain, where it generates diverse and sophisticated behaviors that create and solve a wide range of environmental challenges, many of which cannot be solved by direct optimization, or by a direct-path curriculum-building control algorithm. We hope that POET will inspire a new push towards open-ended discovery across many domains.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 66 条
  • [1] Anderson C. W., 1989, IEEE Control Systems Magazine, V9, P31, DOI 10.1109/37.24809
  • [2] [Anonymous], P 11 INT C ART LIF A
  • [3] [Anonymous], ARXIV18100227
  • [4] [Anonymous], 2016, PROC INT C MACH LEAR
  • [5] [Anonymous], 2017, ARXIV171206564
  • [6] [Anonymous], 2018, ARXIV181003779
  • [7] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
  • [8] [Anonymous], 2018, ARXIV180604640
  • [9] [Anonymous], 2016, BIPEDALWALKERHARDCOR
  • [10] [Anonymous], 2018, ARXIV180703392