Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引：0

作者：

Fu, Zhao-Yang ^{[1
]}

Zhan, De-Chuan ^{[1
]}

Li, Xin-Chun ^{[1
]}

Lu, Yi-Xing ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.

引用

页码：2336 / 2342

页数：7

共 50 条

[21] Reinforcement learning with immediate rewards and linear hypotheses
Abe, N
Biermann, AW
Long, PM
ALGORITHMICA, 2003, 37 (04) : 263 - 293
[22] Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics
Herman, Michael
Gindele, Tobias
Wagner, Joerg
Schmitt, Felix
Burgard, Wolfram
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 102 - 110
[23] Constrained reinforcement learning from intrinsic and extrinsic rewards
Uchibe, Eiji
Doya, Kenji
2007 IEEE 6TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2007, : 45 - +
[24] Accelerating Lifelong Reinforcement Learning via Reshaping Rewards
Chu, Kun
Zhu, Xianchao
Zhu, William
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 619 - 624
[25] Reinforcement Learning and Additional Rewards for the Traveling Salesman Problem
Mele, Umberto Junior
Chou, Xiaochen
Gambardella, Luca Maria
Montemanni, Roberto
2021 THE 8TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS-EUROPE, ICIEA 2021-EUROPE, 2021, : 198 - 204
[26] Orientation-Preserving Rewards' Balancing in Reinforcement Learning
Ren, Jinsheng
Guo, Shangqi
Chen, Feng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6458 - 6472
[27] Deep Reinforcement Learning For SPORADIC Rewards With HUMAN Experience
Sinha, Harshit
PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
[28] Tentative Exploration on Reinforcement Learning Algorithms for Stochastic Rewards
Pena, Luis
LaTorre, Antonio
Pena, Jose-Maria
Ossowski, Sascha
HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 : 336 - 343
[29] Reinforcement Learning with Automated Auxiliary Loss Search
He, Tairan
Zhang, Yuge
Ren, Kan
Liu, Minghuan
Wang, Che
Zhang, Weinan
Yang, Yuqing
Li, Dongsheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[30] Reinforcement Learning via Auxiliary Task Distillation
Harish, Abhinav Narayan
Heck, Larry
Hanna, Josiah P.
Zsolt
Szot, Andrew
COMPUTER VISION - ECCV 2024, PT LXXXI, 2025, 15139 : 214 - 230

← 1 2 3 4 5 →