Example-guided learning of stochastic human driving policies using deep reinforcement learning

被引:0
作者
Ran Emuna
Rotem Duffney
Avinoam Borowsky
Armin Biess
机构
[1] Ben-Gurion University of the Negev,Department of Industrial Engineering and Management
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Deep reinforcement learning; Imitation learning; Human driving policies; Gaussian processes;
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning has been successfully applied to the generation of goal-directed behavior in artificial agents. However, existing algorithms are often not designed to reproduce human-like behavior, which may be desired in many environments, such as human–robot collaborations, social robotics and autonomous vehicles. Here we introduce a model-free and easy-to-implement deep reinforcement learning approach to mimic the stochastic behavior of a human expert by learning distributions of task variables from examples. As tractable use-cases, we study static and dynamic obstacle avoidance tasks for an autonomous vehicle on a highway road in simulation (Unity). Our control algorithm receives a feedback signal from two sources: a deterministic (handcrafted) part encoding basic task goals and a stochastic (data-driven) part that incorporates human expert knowledge. Gaussian processes are used to model human state distributions and to assess the similarity between machine and human behavior. Using this generic approach, we demonstrate that the learning agent acquires human-like driving skills and can generalize to new roads and obstacle distributions unseen during training.
引用
收藏
页码:16791 / 16804
页数:13
相关论文
共 48 条
[1]  
François-Lavet V(2018)An introduction to deep reinforcement learning Found Trends Mach Learn 11 219-354
[2]  
Henderson P(2019)Social behavior for autonomous vehicles Proc Natl Acad Sci 116 24972-24978
[3]  
Islam R(1994)Models of driving behavior: a review of their evolution Accid Anal Prev 26 733-750
[4]  
Bellemare MG(2005)Towards a general theory of driver behaviour Accid Anal Prev 37 461-472
[5]  
Pineau J(2007)Driver models in automobile dynamics application Veh Syst Dyn 45 699-741
[6]  
Schwarting W(2019)MIT Advanced vehicle technology study: large-scale naturalistic driving study of driver behavior and interaction with automation IEEE Access 7 102021-102038
[7]  
Pierson A(2018)Deepmimic: example-guided deep reinforcement learning of physics-based character skills ACM Graph (TOG) 37 143-6802
[8]  
Alonso-Mora J(2018)Learning driver-specific behavior for overtaking: a combined learning framework IEEE Trans Veh Technol 67 6788-368
[9]  
Karaman S(2018)Human-like autonomous car-following model with deep reinforcement learning Transport Res Part C 97 348-179
[10]  
Rus D(2018)An algorithmic perspective on imitation learning Founda Trends Robot 7 1-291