Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards

被引：0

作者：

Fu, Zhao-Yang ^{[1
]}

Zhan, De-Chuan ^{[1
]}

Li, Xin-Chun ^{[1
]}

Lu, Yi-Xing ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning has played an important role in decision making related applications, e.g., robotics motion, self-driving, recommendation, etc. The reward function, as a crucial component, affects the efficiency and effectiveness of reinforcement learning to a large extent. In this paper, we focus on the investigation of reinforcement learning with more than one auxiliary reward. It is found that different auxiliary rewards can boost up the learning rate and effectiveness in different stages, and consequently we propose the Automatic Successive Reinforcement Learning (AsR) for auxiliary rewards grading selection for efficient reinforcement learning by stages. Experiments and simulations have shown the superiority of our proposed AsR on a range of environments, including OpenAI classical control domains and video games; Freeway and Catcher.

引用

页码：2336 / 2342

页数：7

共 50 条

[41] Demonstration and offset augmented meta reinforcement learning with sparse rewards
Li, Haorui
Liang, Jiaqi
Wang, Xiaoxuan
Jiang, Chengzhi
Li, Linjing
Zeng, Daniel
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
[42] Robust Offline Reinforcement Learning with Heavy-Tailed Rewards
Zhu, Jin
Wan, Runzhe
Qi, Zhengling
Luo, Shikai
Shi, Chengchun
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[43] Individual Versus Difference Rewards on Reinforcement Learning for Route Choice
Grunitzki, Ricardo
Ramos, Gabriel de O.
Bazzan, Ana L. C.
2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 253 - 258
[44] Reinforcement Learning With Composite Rewards for Production Scheduling in a Smart Factory
Zhou, Tong
Tang, Dunbing
Zhu, Haihua
Wang, Liping
IEEE ACCESS, 2021, 9 : 752 - 766
[45] Contrastive Visual Explanations for Reinforcement Learning via Counterfactual Rewards
Liu, Xiaowei
McAreavey, Kevin
Liu, Weiru
EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT II, 2023, 1902 : 72 - 87
[46] INFLUENCE OF REINFORCEMENT TECHNIQUE ON EFFECTS OF MATERIAL REWARDS IN CHILDRENS LEARNING
MCCULLER.JC
STAAT, J
PSYCHONOMIC SCIENCE, 1972, 29 (4B): : 267 - &
[47] Exploring selfish reinforcement learning in repeated games with stochastic rewards
Verbeeck, Katja
Nowe, Ann
Parent, Johan
Tuyls, Karl
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 14 (03) : 239 - 269
[48] MoleGuLAR: Molecule Generation Using Reinforcement Learning with Alternating Rewards
Goel, Manan
Raghunathan, Shampa
Laghuvarapu, Siddhartha
Priyakumar, U. Deva
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (12) : 5815 - 5826
[49] Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning
Dietterich, Thomas
Trimponias, George
Chen, Zhitang
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[50] Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
Uchibe, Eiji
Doya, Kenji
NEURAL NETWORKS, 2008, 21 (10) : 1447 - 1455

← 1 2 3 4 5 →