Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引:0
|
作者
Fu, Zhao-Yang [1 ]
Zhan, De-Chuan [1 ]
Li, Xin-Chun [1 ]
Lu, Yi-Xing [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.
引用
收藏
页码:2336 / 2342
页数:7
相关论文
共 50 条
  • [1] Reinforcement Learning with Multiple Shared Rewards
    Guisi, Douglas M.
    Ribeiro, Richardson
    Teixeira, Marcelo
    Borges, Andre P.
    Enembreck, Fabricio
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 855 - 864
  • [2] Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
    Li, Siyuan
    Wang, Rui
    Tang, Minxue
    Zhang, Chongjie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Reinforcement Learning for Joint Optimization of Multiple Rewards
    Agarwal, Mridul
    Aggarwal, Vaneet
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [4] Reinforcement Learning Can Be More Efficient with Multiple Rewards
    Dann, Christoph
    Mansour, Yishay
    Mohri, Mehryar
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [5] Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment
    Li, S
    Yao, JY
    Ye, Z
    Sun, ZQ
    PROCEEDINGS OF THE 2001 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA'01), 2001, : 317 - 322
  • [6] Reinforcement Learning with Perturbed Rewards
    Wang, Jingkang
    Liu, Yang
    Li, Bo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6202 - 6209
  • [7] Deep Reinforcement Learning With Multiple Unrelated Rewards for AGV Mapless Navigation
    Cai, Boliang
    Wei, Changyun
    Ji, Ze
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 18
  • [8] Learning Intrinsic Symbolic Rewards in Reinforcement Learning
    Sheikh, Hassam Ullah
    Khadka, Shauharda
    Miret, Santiago
    Majumdar, Somdeb
    Phielipp, Mariano
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] Online learning of shaping rewards in reinforcement learning
    Grzes, Marek
    Kudenko, Daniel
    NEURAL NETWORKS, 2010, 23 (04) : 541 - 550
  • [10] Reinforcement Learning With Temporal Logic Rewards
    Li, Xiao
    Vasile, Cristian-Ioan
    Belta, Calin
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3834 - 3839