Optimal combination of imitation and reinforcement learning for self-driving cars

被引:2
|
作者
Youssef F. [1 ]
Houda B. [1 ]
机构
[1] National School of Computer Science and Systems Analysis (ENSIAS), Mohammed V University, Rabat
来源
Revue d'Intelligence Artificielle | 2019年 / 33卷 / 04期
关键词
Behavioral cloning; Deep reinforcement learning; Expert's trust margin; Prioritized experience replay; Simulation environment; Supervised imitation learning;
D O I
10.18280/ria.330402
中图分类号
学科分类号
摘要
The two steps in human intelligence development, namely, mimicking and tentative application of expertise, are reflected by imitation learning (IL) and reinforcement learning (RL) in artificial intelligence (AI). However, the RL process does not always improve the skills learned from expert demonstrations and enhance the algorithm performance. To solve the problem, this paper puts forward a novel algorithm called optimal combination of imitation and reinforcement learning (OCIRL). First, the concept of deep q-learning from demonstrations (DQfD) was introduced to the actor-critic (A2C) model, creating the A2CfD model. Then, a threshold was estimated from a trained IL model with the same inputs and reward function with the DOfD, and applied to the A2CfD model. The threshold represents the minimum reward that conserves the learned expertise. The resulting A2CfDoC model was trained and tested on self-driving cars in both discrete and continuous environments. The results show that the model outperformed several existing algorithms in terms of speed and accuracy. © 2019 Lavoisier. All rights reserved.
引用
收藏
页码:265 / 273
页数:8
相关论文
共 50 条
  • [21] Continuous improvement of self-driving cars using dynamic confidence-aware reinforcement learning
    Cao, Zhong
    Jiang, Kun
    Zhou, Weitao
    Xu, Shaobing
    Peng, Huei
    Yang, Diange
    NATURE MACHINE INTELLIGENCE, 2023, 5 (02) : 145 - 158
  • [22] Self-driving Cars Using CNN and Q-learning
    Chishti, Syed Owais Ali
    Riaz, Sana
    Zaib, Muhammad Bilal
    Nauman, Mohammad
    2018 IEEE 21ST INTERNATIONAL MULTI-TOPIC CONFERENCE (INMIC), 2018,
  • [23] A Review of Machine Learning Techniques Utilised in Self-Driving Cars
    Dhaif Z.S.
    El Abbadi N.K.
    Iraqi Journal for Computer Science and Mathematics, 2024, 5 (01): : 205 - 219
  • [24] Self-Driving Cars and Trucks Are on the Move
    Hassler, Susan
    IEEE SPECTRUM, 2017, 54 (01) : 5 - 5
  • [25] Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention
    Kim, Jinkyu
    Canny, John
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2961 - 2969
  • [26] Working from self-driving cars
    Hirte, Georg
    Laes, Renee
    Gerike, Regine
    TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2023, 176
  • [27] Self-driving cars and the urban challenge
    Urmson, Chris
    Whittaker, William Red
    IEEE INTELLIGENT SYSTEMS, 2008, 23 (02) : 66 - 68
  • [28] Self-driving Cars and the Right to Drive
    Ratoff W.
    Philosophy & Technology, 2022, 35 (3)
  • [29] Still Waiting for Self-Driving Cars
    Kirkpatrick, Keith
    COMMUNICATIONS OF THE ACM, 2022, 65 (04) : 12 - 14
  • [30] Merging self-driving cars with the law
    De Bruyne, Jan
    Werbrouck, Jarich
    Computer Law and Security Review, 2018, 34 (05): : 1150 - 1153