Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning With Application to Autonomous Driving

被引：72

作者：

Wu, Zheng ^{[1
]}

Sun, Liting ^{[1
]}

Zhan, Wei ^{[1
]}

Yang, Chenyu ^{[2
]}

Tomizuka, Masayoshi ^{[1
]}

机构：

[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94709 USA

[2] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai 200240, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2020年 / 5卷 / 04期

关键词：

Learning from demonstration; intelligent transportation systems; inverse reinforcement learning; autonomous driving; social human-robot interaction; ALGORITHMS;

D O I：

10.1109/LRA.2020.3005126

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning become increasingly powerful when solving the forward problem: given designed reward/cost functions, how we should optimize them and obtain driving policies that interact with the environment safely and efficiently. Such progress has raised another equally important question: what should we optimize? Instead of manually specifying the reward functions, it is desired that we can extract what human drivers try to optimize from real traffic data and assign that to autonomous vehicles to enable more naturalistic and transparent interaction between humans and intelligent agents. To address this issue, we present an efficient sampling-based maximum-entropy inverse reinforcement learning (IRL) algorithm in this letter. Different from existing IRL algorithms, by introducing an efficient continuous-domain trajectory sampler, the proposed algorithm can directly learn the reward functions in the continuous domain while considering the uncertainties in demonstrated trajectories from human drivers. We evaluate the proposed algorithm via real-world driving data, including both non-interactive and interactive scenarios. The experimental results show that the proposed algorithm achieves more accurate prediction performance with faster convergence speed and better generalization compared to other baseline IRL algorithms.

引用

页码：5355 / 5362

页数：8

共 50 条

[31] Deep Reinforcement Learning for Autonomous Driving Based on Safety Experience Replay [J].

Huang, Xiaohan ;

Cheng, Yuhu ;

Yu, Qiang ;

Wang, Xuesong .

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (06) :2070-2084

[32] Safe and Interpretable Human-Like Planning With Transformer-Based Deep Inverse Reinforcement Learning for Autonomous Driving [J].

Nan, Jiangfeng ;

Zhang, Ruzheng ;

Yin, Guodong ;

Zhuang, Weichao ;

Zhang, Yilong ;

Deng, Weiwen .

IEEE Transactions on Automation Science and Engineering, 2025, 22 :12134-12146

[33] Sparse online maximum entropy inverse reinforcement learning via proximal optimization and truncated gradient [J].

Song L. ;

Li D. ;

Xu X. .

Knowledge-Based Systems, 2022, 252

[34] Reinforcement learning-based autonomous driving control for efficient road utilization in lane-less environments [J].

Mao Tobisawa ;

Kenji Matsuda ;

Tenta Suzuki ;

Tomohiro Harada ;

Junya Hoshino ;

Yuki Itoh ;

Kaito Kumagae ;

Johei Matsuoka ;

Kiyohiko Hattori .

Artificial Life and Robotics, 2025, 30 (2) :276-288

[35] A Safe and Efficient Lane Change Decision-Making Strategy of Autonomous Driving Based on Deep Reinforcement Learning [J].

Lv, Kexuan ;

Pei, Xiaofei ;

Chen, Ci ;

Xu, Jie .

MATHEMATICS, 2022, 10 (09)

[36] Reinforcement Learning Driving Strategy based on Auxiliary Task for Multi-Scenarios Autonomous Driving [J].

Sun, Jingbo ;

Fang, Xing ;

Zhang, Qichao .

2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, :1337-1342

[37] Applications and Challenges of Reinforcement Learning in Autonomous Driving Technology [J].

He Y. ;

Lin H. ;

Liu Y. ;

Yang L. ;

Qu X. .

Tongji Daxue Xuebao/Journal of Tongji University, 2024, 52 (04) :520-531

[38] End-to-End Autonomous Driving Decision Based on Deep Reinforcement Learning [J].

Huang, Zhiqing ;

Zhang, Ji ;

Tian, Rui ;

Zhang, Yanxin .

CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, :658-662

[39] A Hierarchical Framework for Multi-Lane Autonomous Driving Based on Reinforcement Learning [J].

Zhang, Xiaohui ;

Sun, Jie ;

Wang, Yunpeng ;

Sun, Jian .

IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 4 :626-638

[40] Deep Reinforcement Learning with Noisy Exploration for Autonomous Driving [J].

Li, Ruyang ;

Zhang, Yaqiang ;

Zhao, Yaqian ;

Wei, Hui ;

Xu, Zhe ;

Zhao, Kun .

PROCEEDINGS OF 2022 THE 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, ICMLSC 20222, 2022, :8-14

← 1 2 3 4 5 →