Modified reward function on abstract features in inverse reinforcement learning

被引:2
|
作者
Shenyi CHENHui QIANJia FANZhuojun JINMiaoliang ZHUSchool of Computer Science and TechnologyZhejiang UniversityHangzhou China [310027 ]
机构
关键词
D O I
暂无
中图分类号
TP181 [自动推理、机器学习];
学科分类号
摘要
We improve inverse reinforcement learning(IRL) by applying dimension reduction methods to automatically extract Abstract features from human-demonstrated policies,to deal with the cases where features are either unknown or numerous.The importance rating of each abstract feature is incorporated into the reward function.Simulation is performed on a task of driving in a five-lane highway,where the controlled car has the largest fixed speed among all the cars.Performance is almost 10.6% better on average with than without importance ratings.
引用
收藏
页码:718 / 723
页数:6
相关论文
共 50 条
  • [31] Reward Reports for Reinforcement Learning
    Gilbert, Thomas Krendl
    Lambert, Nathan
    Dean, Sarah
    Zick, Tom
    Snoswell, Aaron
    Mehta, Soham
    PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 84 - 130
  • [32] Reward, motivation, and reinforcement learning
    Dayan, P
    Balleine, BW
    NEURON, 2002, 36 (02) : 285 - 298
  • [33] Reinforcement learning model with a reward function based on human driving characteristics
    Pan, Feng
    Bao, Hong
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 225 - 229
  • [34] Reinforcement Learning With Constrained Uncertain Reward Function Through Particle Filtering
    Dogru, Oguzhan
    Chiplunkar, Ranjith
    Huang, Biao
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (07) : 7491 - 7499
  • [35] Fear based Intrinsic Reward as a Barrier Function for Continuous Reinforcement Learning
    Sanchez, Rodney
    Sahin, Ferat
    Heard, Jamison
    2024 19TH ANNUAL SYSTEM OF SYSTEMS ENGINEERING CONFERENCE, SOSE 2024, 2024, : 140 - 146
  • [36] LTL and Beyond: Formal Languages for Reward Function Specification in Reinforcement Learning
    Camacho, Alberto
    Icarte, Rodrigo Toro
    Klassen, Toryn Q.
    Valenzano, Richard
    McIlraith, Sheila A.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6065 - 6073
  • [37] Reinforcement-Learning-Based Path Planning: A Reward Function Strategy
    Jaramillo-Martinez, Ramon
    Chavero-Navarrete, Ernesto
    Ibarra-Perez, Teodoro
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [38] Learning Reward Models for Cooperative Trajectory Planning with Inverse Reinforcement Learning and Monte Carlo Tree Search
    Kurzer, Karl
    Bitzer, Matthias
    Zoellner, J. Marius
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 22 - 28
  • [39] OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning
    Henderson, Peter
    Chang, Wei-Di
    Bacon, Pierre-Luc
    Meger, David
    Pineau, Joelle
    Precup, Doina
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3199 - 3206
  • [40] A state-based inverse reinforcement learning approach to model activity-travel choices behavior with reward function recovery
    Song, Yuchen
    Li, Dawei
    Ma, Zhenliang
    Liu, Dongjie
    Zhang, Tong
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 158