Modified reward function on abstract features in inverse reinforcement learning

被引：2

作者：

Shenyi CHENHui QIANJia FANZhuojun JINMiaoliang ZHUSchool of Computer Science and TechnologyZhejiang UniversityHangzhou China ^{[310027
]}

机构：

来源：

Journal of Zhejiang University-Science C(Computer & Electronics) | 2010年 / 11卷 / 09期

关键词：

D O I：

暂无

中图分类号：

TP181 [自动推理、机器学习];

学科分类号：

摘要：

We improve inverse reinforcement learning(IRL) by applying dimension reduction methods to automatically extract Abstract features from human-demonstrated policies,to deal with the cases where features are either unknown or numerous.The importance rating of each abstract feature is incorporated into the reward function.Simulation is performed on a task of driving in a five-lane highway,where the controlled car has the largest fixed speed among all the cars.Performance is almost 10.6% better on average with than without importance ratings.

引用

页码：718 / 723

页数：6

共 50 条

[31] Reward Reports for Reinforcement Learning
Gilbert, Thomas Krendl
Lambert, Nathan
Dean, Sarah
Zick, Tom
Snoswell, Aaron
Mehta, Soham
PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 84 - 130
[32] Reward, motivation, and reinforcement learning
Dayan, P
Balleine, BW
NEURON, 2002, 36 (02) : 285 - 298
[33] Reinforcement learning model with a reward function based on human driving characteristics
Pan, Feng
Bao, Hong
2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 225 - 229
[34] Reinforcement Learning With Constrained Uncertain Reward Function Through Particle Filtering
Dogru, Oguzhan
Chiplunkar, Ranjith
Huang, Biao
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (07) : 7491 - 7499
[35] Fear based Intrinsic Reward as a Barrier Function for Continuous Reinforcement Learning
Sanchez, Rodney
Sahin, Ferat
Heard, Jamison
2024 19TH ANNUAL SYSTEM OF SYSTEMS ENGINEERING CONFERENCE, SOSE 2024, 2024, : 140 - 146
[36] LTL and Beyond: Formal Languages for Reward Function Specification in Reinforcement Learning
Camacho, Alberto
Icarte, Rodrigo Toro
Klassen, Toryn Q.
Valenzano, Richard
McIlraith, Sheila A.
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6065 - 6073
[37] Reinforcement-Learning-Based Path Planning: A Reward Function Strategy
Jaramillo-Martinez, Ramon
Chavero-Navarrete, Ernesto
Ibarra-Perez, Teodoro
APPLIED SCIENCES-BASEL, 2024, 14 (17):
[38] Learning Reward Models for Cooperative Trajectory Planning with Inverse Reinforcement Learning and Monte Carlo Tree Search
Kurzer, Karl
Bitzer, Matthias
Zoellner, J. Marius
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 22 - 28
[39] OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning
Henderson, Peter
Chang, Wei-Di
Bacon, Pierre-Luc
Meger, David
Pineau, Joelle
Precup, Doina
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3199 - 3206
[40] A state-based inverse reinforcement learning approach to model activity-travel choices behavior with reward function recovery
Song, Yuchen
Li, Dawei
Ma, Zhenliang
Liu, Dongjie
Zhang, Tong
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 158

← 1 2 3 4 5 →