GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement Learning

被引:2
|
作者
Kovac, Grgur [1 ]
Laversanne-Finot, Adrien [1 ]
Oudeyer, Pierre-Yves [1 ]
机构
[1] INRIA Bordeaux, Flowers Lab, F-33400 Talence, France
关键词
Goal exploration; learning progress; reinforcement learning (RL); INTRINSIC MOTIVATION; EXPLORATION; SYSTEMS;
D O I
10.1109/TCDS.2022.3216911
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autotelic reinforcement learning (RL) agents sample their own goals, and try to reach them. They often prioritize goal sampling according to some intrinsic reward, ex. novelty or absolute learning progress (ALPs). Novelty-based approaches work robustly in unsupervised image-based environments when there are no distractors. However, they construct simple curricula that do not take the agent's performance into account: in complex environments, they often get attracted by impossible tasks. ALP-based approaches, which are often combined with a clustering mechanism, construct complex curricula tuned to the agent's current capabilities. Such curricula sample goals on which the agent is currently learning the most, and do not get attracted by impossible tasks. However, ALP approaches have not so far been applied to DRL agents perceiving complex environments directly in the image space. Goal regions guided intrinsically motivated goal exploration process (GRIMGEP), without using any expert knowledge, combines the ALP clustering approaches with novelty-based approaches and extends them to those complex scenarios. We experiment on a rich 3-D image-based environment with distractors using novelty-based exploration approaches: Skewfit and CountBased. We show that wrapping them with GRIMGEP-using them only in the cluster sampled by ALP-creates a better curriculum. The wrapped approaches are attracted less by the distractors, and achieve drastically better performances.
引用
收藏
页码:1396 / 1407
页数:12
相关论文
共 50 条
  • [41] Adaptive DAG Tasks Scheduling with Deep Reinforcement Learning
    Wu, Qing
    Wu, Zhiwei
    Zhuang, Yuehui
    Cheng, Yuxia
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT II, 2018, 11335 : 477 - 490
  • [42] Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
    Haydari, Ammar
    Yilmaz, Yasin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 11 - 32
  • [43] Reinforcement learning for control: Performance, stability, and deep approximators
    Busoniu, Lucian
    de Bruin, Tim
    Tolic, Domagoj
    Kober, Jens
    Palunko, Ivana
    ANNUAL REVIEWS IN CONTROL, 2018, 46 : 8 - 28
  • [44] Deep Reinforcement Learning with Risk-Seeking Exploration
    Dilokthanakul, Nat
    Shanahan, Murray
    FROM ANIMALS TO ANIMATS 15, 2018, 10994 : 201 - 211
  • [45] Structure in Deep Reinforcement Learning: A Survey and Open Problems
    Mohan, Aditya
    Zhang, Amy
    Lindauer, Marius
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 1167 - 1236
  • [46] Deep Reinforcement Learning for Sequence-to-Sequence Models
    Keneshloo, Yaser
    Shi, Tian
    Ramakrishnan, Naren
    Reddy, Chandan K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2469 - 2489
  • [47] Deep Reinforcement Learning for Solving AGVs Routing Problem
    Lu, Chengxuan
    Long, Jinjun
    Xing, Zichao
    Wu, Weimin
    Gu, Yong
    Luo, Jiliang
    Huang, Yisheng
    VERIFICATION AND EVALUATION OF COMPUTER AND COMMUNICATION SYSTEMS, VECOS 2020, 2020, 12519 : 222 - 236
  • [48] Deep reinforcement learning with combinatorial actions spaces: An to maintenance
    Goby, Niklas
    Brandt, Tobias
    Neumann, Dirk
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 179
  • [49] Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
    Luong, Nguyen Cong
    Hoang, Dinh Thai
    Gong, Shimin
    Niyato, Dusit
    Wang, Ping
    Liang, Ying-Chang
    Kim, Dong In
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (04): : 3133 - 3174
  • [50] Quadrotor navigation in dynamic environments with deep reinforcement learning
    Fang, Jinbao
    Sun, Qiyu
    Chen, Yukun
    Tang, Yang
    ASSEMBLY AUTOMATION, 2021, 41 (03) : 254 - 262