Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion

被引:59
作者
Fernando, Tharindu [1 ]
Denman, Simon [2 ]
Sridharan, Sridha [3 ,4 ]
Fookes, Clinton [5 ]
机构
[1] QUT, Sch Elect Engn & Comp Sci, Speech Audio Image & Video Technol Res Program, Brisbane, Qld, Australia
[2] QUT, Sch Elect Engn & Comp Sci, Brisbane, Qld, Australia
[3] Queensland Univ Technol QUT, Sch Elect Engn & Comp Sci, Brisbane, Qld, Australia
[4] QUT, Speech Audio Image & Video Technol Res Program, Brisbane, Qld, Australia
[5] QUT, Discipline Vis & Signal Proc, Sci & Engn Fac, Brisbane, Qld, Australia
关键词
Deep learning; Navigation; Supervised learning; Decision making; Focusing; Reinforcement learning; Autonomous vehicles; FRAMEWORK;
D O I
10.1109/MSP.2020.2988287
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate behavior anticipation is essential for autonomous vehicles when navigating in close proximity to other vehicles, pedestrians, and cyclists. Thanks to the recent advances in deep learning and inverse reinforcement learning (IRL), we observe a tremendous opportunity to address this need, which was once believed impossible given the complex nature of human decision making. In this article, we summarize the importance of accurate behavior modeling in autonomous driving and analyze the key approaches and major progress that researchers have made, focusing on the potential of deep IRL (D-IRL) to overcome the limitations of previous techniques. We provide quantitative and qualitative evaluations substantiating these observations. Although the field of D-IRL has seen recent successes, its application to model behavior in autonomous driving is largely unexplored. As such, we conclude this article by summarizing the exciting pathways for future breakthroughs.
引用
收藏
页码:87 / 96
页数:10
相关论文
共 36 条
[1]  
[Anonymous], 2018, O. Medium
[2]  
Anthony S., 2017, QUARTZ
[3]   A MARKOVIAN DECISION PROCESS [J].
BELLMAN, R .
JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684
[4]  
Caesar H., 2019, NUSCENES MULTIMODAL
[5]  
Colyar J., 2007, Tech. Rep. FHWA-HRT-07-030
[6]  
Costa AD, 2017, IEEE INT NEW CIRC, P225, DOI 10.1109/NEWCAS.2017.8010146
[7]  
Deo N, 2018, IEEE INT VEH SYM, P1179, DOI 10.1109/IVS.2018.8500493
[8]   Convolutional Social Pooling for Vehicle Trajectory Prediction [J].
Deo, Nachiket ;
Trivedi, Mohan M. .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :1549-1557
[9]  
Fernando T., 2018, P JOINT EUR C MACH L, P241
[10]   Neighbourhood Context Embeddings in Deep Inverse Reinforcement Learning for Predicting Pedestrian Motion Over Long Time Horizons [J].
Fernando, Tharindu ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1179-1187