Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引:0
|
作者
Fu, Zhao-Yang [1 ]
Zhan, De-Chuan [1 ]
Li, Xin-Chun [1 ]
Lu, Yi-Xing [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.
引用
收藏
页码:2336 / 2342
页数:7
相关论文
共 50 条
  • [31] Off-Policy Reinforcement Learning with Delayed Rewards
    Han, Beining
    Ren, Zhizhou
    Wu, Zuofan
    Zhou, Yuan
    Peng, Jian
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] Adaptive Auxiliary Task Weighting for Reinforcement Learning
    Lin, Xingyu
    Baweja, Harjatin Singh
    Kantor, George
    Held, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [33] Learning Circuit Placement Techniques through Reinforcement Learning with Adaptive Rewards
    Vassallo, Luke
    Bajada, Josef
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [34] Rewards Prediction-Based Credit Assignment for Reinforcement Learning With Sparse Binary Rewards
    Seo, Minah
    Vecchietti, Luiz Felipe
    Lee, Sangkeum
    Har, Dongsoo
    IEEE ACCESS, 2019, 7 : 118776 - 118791
  • [35] Split Q Learning: Reinforcement Learning with Two-Stream Rewards
    Lin, Baihan
    Bouneffouf, Djallel
    Cecchi, Guillermo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6448 - 6449
  • [36] State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning With Rewards
    Calvo-Fullana, Miguel
    Paternain, Santiago
    Chamon, Luiz F. O.
    Ribeiro, Alejandro
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (07) : 4275 - 4290
  • [37] Exploring selfish reinforcement learning in repeated games with stochastic rewards
    Katja Verbeeck
    Ann Nowé
    Johan Parent
    Karl Tuyls
    Autonomous Agents and Multi-Agent Systems, 2007, 14 : 239 - 269
  • [38] REVERSAL LEARNING IN A SUCCESSIVE DISCRIMINATION USING INTERMITTENT REINFORCEMENT
    MELLGREN, RL
    OST, JWP
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1970, 84 (01): : 181 - &
  • [39] No-Regret Reinforcement Learning with Heavy-Tailed Rewards
    Zhuang, Vincent
    Sui, Yanan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [40] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
    Devlin, Sam
    Yliniemi, Logan
    Kudenko, Daniel
    Tumer, Kagan
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172