Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引：0

作者：

Fu, Zhao-Yang ^{[1
]}

Zhan, De-Chuan ^{[1
]}

Li, Xin-Chun ^{[1
]}

Lu, Yi-Xing ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.

引用

页码：2336 / 2342

页数：7

共 50 条

[31] Off-Policy Reinforcement Learning with Delayed Rewards
Han, Beining
Ren, Zhizhou
Wu, Zuofan
Zhou, Yuan
Peng, Jian
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[32] Adaptive Auxiliary Task Weighting for Reinforcement Learning
Lin, Xingyu
Baweja, Harjatin Singh
Kantor, George
Held, David
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[33] Learning Circuit Placement Techniques through Reinforcement Learning with Adaptive Rewards
Vassallo, Luke
Bajada, Josef
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[34] Rewards Prediction-Based Credit Assignment for Reinforcement Learning With Sparse Binary Rewards
Seo, Minah
Vecchietti, Luiz Felipe
Lee, Sangkeum
Har, Dongsoo
IEEE ACCESS, 2019, 7 : 118776 - 118791
[35] Split Q Learning: Reinforcement Learning with Two-Stream Rewards
Lin, Baihan
Bouneffouf, Djallel
Cecchi, Guillermo
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6448 - 6449
[36] State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning With Rewards
Calvo-Fullana, Miguel
Paternain, Santiago
Chamon, Luiz F. O.
Ribeiro, Alejandro
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (07) : 4275 - 4290
[37] Exploring selfish reinforcement learning in repeated games with stochastic rewards
Katja Verbeeck
Ann Nowé
Johan Parent
Karl Tuyls
Autonomous Agents and Multi-Agent Systems, 2007, 14 : 239 - 269
[38] REVERSAL LEARNING IN A SUCCESSIVE DISCRIMINATION USING INTERMITTENT REINFORCEMENT
MELLGREN, RL
OST, JWP
JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1970, 84 (01): : 181 - &
[39] No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Zhuang, Vincent
Sui, Yanan
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[40] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
Devlin, Sam
Yliniemi, Logan
Kudenko, Daniel
Tumer, Kagan
AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172

← 1 2 3 4 5 →