Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引:0
|
作者
Fu, Zhao-Yang [1 ]
Zhan, De-Chuan [1 ]
Li, Xin-Chun [1 ]
Lu, Yi-Xing [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.
引用
收藏
页码:2336 / 2342
页数:7
相关论文
共 50 条
  • [41] Demonstration and offset augmented meta reinforcement learning with sparse rewards
    Li, Haorui
    Liang, Jiaqi
    Wang, Xiaoxuan
    Jiang, Chengzhi
    Li, Linjing
    Zeng, Daniel
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
  • [42] Robust Offline Reinforcement Learning with Heavy-Tailed Rewards
    Zhu, Jin
    Wan, Runzhe
    Qi, Zhengling
    Luo, Shikai
    Shi, Chengchun
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [43] Individual Versus Difference Rewards on Reinforcement Learning for Route Choice
    Grunitzki, Ricardo
    Ramos, Gabriel de O.
    Bazzan, Ana L. C.
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 253 - 258
  • [44] Reinforcement Learning With Composite Rewards for Production Scheduling in a Smart Factory
    Zhou, Tong
    Tang, Dunbing
    Zhu, Haihua
    Wang, Liping
    IEEE ACCESS, 2021, 9 : 752 - 766
  • [45] Contrastive Visual Explanations for Reinforcement Learning via Counterfactual Rewards
    Liu, Xiaowei
    McAreavey, Kevin
    Liu, Weiru
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT II, 2023, 1902 : 72 - 87
  • [46] INFLUENCE OF REINFORCEMENT TECHNIQUE ON EFFECTS OF MATERIAL REWARDS IN CHILDRENS LEARNING
    MCCULLER.JC
    STAAT, J
    PSYCHONOMIC SCIENCE, 1972, 29 (4B): : 267 - &
  • [47] Exploring selfish reinforcement learning in repeated games with stochastic rewards
    Verbeeck, Katja
    Nowe, Ann
    Parent, Johan
    Tuyls, Karl
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 14 (03) : 239 - 269
  • [48] MoleGuLAR: Molecule Generation Using Reinforcement Learning with Alternating Rewards
    Goel, Manan
    Raghunathan, Shampa
    Laghuvarapu, Siddhartha
    Priyakumar, U. Deva
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (12) : 5815 - 5826
  • [49] Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning
    Dietterich, Thomas
    Trimponias, George
    Chen, Zhitang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [50] Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
    Uchibe, Eiji
    Doya, Kenji
    NEURAL NETWORKS, 2008, 21 (10) : 1447 - 1455