Optimal combination of imitation and reinforcement learning for self-driving cars

被引:2
|
作者
Youssef F. [1 ]
Houda B. [1 ]
机构
[1] National School of Computer Science and Systems Analysis (ENSIAS), Mohammed V University, Rabat
来源
Revue d'Intelligence Artificielle | 2019年 / 33卷 / 04期
关键词
Behavioral cloning; Deep reinforcement learning; Expert's trust margin; Prioritized experience replay; Simulation environment; Supervised imitation learning;
D O I
10.18280/ria.330402
中图分类号
学科分类号
摘要
The two steps in human intelligence development, namely, mimicking and tentative application of expertise, are reflected by imitation learning (IL) and reinforcement learning (RL) in artificial intelligence (AI). However, the RL process does not always improve the skills learned from expert demonstrations and enhance the algorithm performance. To solve the problem, this paper puts forward a novel algorithm called optimal combination of imitation and reinforcement learning (OCIRL). First, the concept of deep q-learning from demonstrations (DQfD) was introduced to the actor-critic (A2C) model, creating the A2CfD model. Then, a threshold was estimated from a trained IL model with the same inputs and reward function with the DOfD, and applied to the A2CfD model. The threshold represents the minimum reward that conserves the learned expertise. The resulting A2CfDoC model was trained and tested on self-driving cars in both discrete and continuous environments. The results show that the model outperformed several existing algorithms in terms of speed and accuracy. © 2019 Lavoisier. All rights reserved.
引用
收藏
页码:265 / 273
页数:8
相关论文
共 50 条
  • [31] Controlled Parking for Self-Driving Cars
    Tariq, Shahroz
    Choi, Hyunsoo
    Wasiq, C. M.
    Park, Heemin
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 1861 - 1865
  • [32] THE TRUTH ABOUT "SELF-DRIVING" CARS
    Shladover, Steven E.
    SCIENTIFIC AMERICAN, 2016, 314 (06) : 53 - 57
  • [33] SELF-DRIVING CARS’ INTRODUCTION AND INFLUENCE
    唐静媛
    何少芳
    科技经济导刊, 2017, (15) : 204 - 206
  • [34] Merging self-driving cars with the law
    De Bruyne, Jan
    Werbrouck, Jarich
    COMPUTER LAW & SECURITY REVIEW, 2018, 34 (05) : 1150 - 1153
  • [35] Self-driving self-managing cars
    Tagg, Brian
    NEW SCIENTIST, 2016, 229 (3066) : 60 - 60
  • [36] LENS MATERIAL FOR SELF-DRIVING CARS
    不详
    ADVANCED MATERIALS & PROCESSES, 2023, 181 (06): : 11 - 11
  • [37] Carpooling and the Economics of Self-Driving Cars
    Ostrovsky, Michael
    Schwarz, Michael
    ACM EC '19: PROCEEDINGS OF THE 2019 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2019, : 581 - 582
  • [38] Self-driving cars will change cities
    Zakharenko, Roman
    REGIONAL SCIENCE AND URBAN ECONOMICS, 2016, 61 : 26 - 37
  • [39] Fundamentals and development of self-driving cars
    Yoganandhan, A.
    Subhash, S. D.
    Jothi, J. Hebinson
    Mohanavel, V
    MATERIALS TODAY-PROCEEDINGS, 2020, 33 : 3303 - 3310
  • [40] Self-driving cars: A city perspective
    Duarte, Fabio
    SCIENCE ROBOTICS, 2019, 4 (28)