Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning With Application to Autonomous Driving

被引:72
作者
Wu, Zheng [1 ]
Sun, Liting [1 ]
Zhan, Wei [1 ]
Yang, Chenyu [2 ]
Tomizuka, Masayoshi [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94709 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai 200240, Peoples R China
关键词
Learning from demonstration; intelligent transportation systems; inverse reinforcement learning; autonomous driving; social human-robot interaction; ALGORITHMS;
D O I
10.1109/LRA.2020.3005126
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning become increasingly powerful when solving the forward problem: given designed reward/cost functions, how we should optimize them and obtain driving policies that interact with the environment safely and efficiently. Such progress has raised another equally important question: what should we optimize? Instead of manually specifying the reward functions, it is desired that we can extract what human drivers try to optimize from real traffic data and assign that to autonomous vehicles to enable more naturalistic and transparent interaction between humans and intelligent agents. To address this issue, we present an efficient sampling-based maximum-entropy inverse reinforcement learning (IRL) algorithm in this letter. Different from existing IRL algorithms, by introducing an efficient continuous-domain trajectory sampler, the proposed algorithm can directly learn the reward functions in the continuous domain while considering the uncertainties in demonstrated trajectories from human drivers. We evaluate the proposed algorithm via real-world driving data, including both non-interactive and interactive scenarios. The experimental results show that the proposed algorithm achieves more accurate prediction performance with faster convergence speed and better generalization compared to other baseline IRL algorithms.
引用
收藏
页码:5355 / 5362
页数:8
相关论文
共 50 条
  • [21] A Comprehensive Survey on the Application of Deep and Reinforcement Learning Approaches in Autonomous Driving
    Ben Elallid, Badr
    Benamar, Nabil
    Hafid, Abdelhakim Senhaji
    Rachidi, Tajjeeddine
    Mrani, Nabil
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 7366 - 7390
  • [22] A Behavior Decision Method Based on Reinforcement Learning for Autonomous Driving
    Zheng, Kan
    Yang, Haojun
    Liu, Shiwen
    Zhang, Kuan
    Lei, Lei
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25386 - 25394
  • [23] Car-Following Behavior Modeling With Maximum Entropy Deep Inverse Reinforcement Learning
    Nan, Jiangfeng
    Deng, Weiwen
    Zhang, Ruzheng
    Zhao, Rui
    Wang, Ying
    Ding, Juan
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3998 - 4010
  • [24] Uncertainty-Aware Model-Based Reinforcement Learning: Methodology and Application in Autonomous Driving
    Wu, Jingda
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 194 - 203
  • [25] Adaptive sampling-based motion planning with a non-conservatively defensive strategy for autonomous driving
    Li, Zhaoting
    Zhan, Wei
    Sun, Liting
    Chan, Ching-Yao
    Tomizuka, Masayoshi
    IFAC PAPERSONLINE, 2020, 53 (02): : 15632 - 15638
  • [26] Incorporating Multi-Context Into the Traversability Map for Urban Autonomous Driving Using Deep Inverse Reinforcement Learning
    Jung, Chanyoung
    Shim, David Hyunchul
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1662 - 1669
  • [27] Reinforcement-Learning-Based Trajectory Learning in Frenet Frame for Autonomous Driving
    Yoon, Sangho
    Kwon, Youngjoon
    Ryu, Jaesung
    Kim, Sungkwan
    Choi, Sungwoo
    Lee, Kyungjae
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [28] Path tracking control based on Deep reinforcement learning in Autonomous driving
    Jiang, Le
    Wang, Yafei
    Wang, Lin
    Wu, Jingkai
    2019 3RD CONFERENCE ON VEHICLE CONTROL AND INTELLIGENCE (CVCI), 2019, : 414 - 419
  • [29] Vision-Based Autonomous Driving: A Hierarchical Reinforcement Learning Approach
    Wang, Jiao
    Sun, Haoyi
    Zhu, Can
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11213 - 11226
  • [30] Deep Reinforcement Learning for Autonomous Driving Based on Safety Experience Replay
    Huang, Xiaohan
    Cheng, Yuhu
    Yu, Qiang
    Wang, Xuesong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (06) : 2070 - 2084