Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引:0
|
作者
Fu, Zhao-Yang [1 ]
Zhan, De-Chuan [1 ]
Li, Xin-Chun [1 ]
Lu, Yi-Xing [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.
引用
收藏
页码:2336 / 2342
页数:7
相关论文
共 50 条
  • [21] Reinforcement learning with immediate rewards and linear hypotheses
    Abe, N
    Biermann, AW
    Long, PM
    ALGORITHMICA, 2003, 37 (04) : 263 - 293
  • [22] Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics
    Herman, Michael
    Gindele, Tobias
    Wagner, Joerg
    Schmitt, Felix
    Burgard, Wolfram
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 102 - 110
  • [23] Constrained reinforcement learning from intrinsic and extrinsic rewards
    Uchibe, Eiji
    Doya, Kenji
    2007 IEEE 6TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2007, : 45 - +
  • [24] Accelerating Lifelong Reinforcement Learning via Reshaping Rewards
    Chu, Kun
    Zhu, Xianchao
    Zhu, William
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 619 - 624
  • [25] Reinforcement Learning and Additional Rewards for the Traveling Salesman Problem
    Mele, Umberto Junior
    Chou, Xiaochen
    Gambardella, Luca Maria
    Montemanni, Roberto
    2021 THE 8TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS-EUROPE, ICIEA 2021-EUROPE, 2021, : 198 - 204
  • [26] Orientation-Preserving Rewards' Balancing in Reinforcement Learning
    Ren, Jinsheng
    Guo, Shangqi
    Chen, Feng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6458 - 6472
  • [27] Deep Reinforcement Learning For SPORADIC Rewards With HUMAN Experience
    Sinha, Harshit
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [28] Tentative Exploration on Reinforcement Learning Algorithms for Stochastic Rewards
    Pena, Luis
    LaTorre, Antonio
    Pena, Jose-Maria
    Ossowski, Sascha
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 : 336 - 343
  • [29] Reinforcement Learning with Automated Auxiliary Loss Search
    He, Tairan
    Zhang, Yuge
    Ren, Kan
    Liu, Minghuan
    Wang, Che
    Zhang, Weinan
    Yang, Yuqing
    Li, Dongsheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] Reinforcement Learning via Auxiliary Task Distillation
    Harish, Abhinav Narayan
    Heck, Larry
    Hanna, Josiah P.
    Zsolt
    Szot, Andrew
    COMPUTER VISION - ECCV 2024, PT LXXXI, 2025, 15139 : 214 - 230