Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引：0

作者：

Fu, Zhao-Yang ^{[1
]}

Zhan, De-Chuan ^{[1
]}

Li, Xin-Chun ^{[1
]}

Lu, Yi-Xing ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.

引用

页码：2336 / 2342

页数：7

共 50 条

[1] Reinforcement Learning with Multiple Shared Rewards
Guisi, Douglas M.
Ribeiro, Richardson
Teixeira, Marcelo
Borges, Andre P.
Enembreck, Fabricio
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 855 - 864
[2] Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Li, Siyuan
Wang, Rui
Tang, Minxue
Zhang, Chongjie
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[3] Reinforcement Learning for Joint Optimization of Multiple Rewards
Agarwal, Mridul
Aggarwal, Vaneet
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[4] Reinforcement Learning Can Be More Efficient with Multiple Rewards
Dann, Christoph
Mansour, Yishay
Mohri, Mehryar
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[5] Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment
Li, S
Yao, JY
Ye, Z
Sun, ZQ
PROCEEDINGS OF THE 2001 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA'01), 2001, : 317 - 322
[6] Reinforcement Learning with Perturbed Rewards
Wang, Jingkang
Liu, Yang
Li, Bo
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6202 - 6209
[7] Deep Reinforcement Learning With Multiple Unrelated Rewards for AGV Mapless Navigation
Cai, Boliang
Wei, Changyun
Ji, Ze
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 18
[8] Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Sheikh, Hassam Ullah
Khadka, Shauharda
Miret, Santiago
Majumdar, Somdeb
Phielipp, Mariano
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[9] Online learning of shaping rewards in reinforcement learning
Grzes, Marek
Kudenko, Daniel
NEURAL NETWORKS, 2010, 23 (04) : 541 - 550
[10] Reinforcement Learning With Temporal Logic Rewards
Li, Xiao
Vasile, Cristian-Ioan
Belta, Calin
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3834 - 3839

← 1 2 3 4 5 →