Robustness and performance of Deep Reinforcement Learning

被引:16
作者
Al-Nima, Raid Rafi Omar [1 ]
Han, Tingting [2 ]
Al-Sumaidaee, Saadoon Awad Mohammed [3 ]
Chen, Taolue [2 ]
Woo, Wai Lok [4 ]
机构
[1] Northern Tech Univ, Coll Mosul, Tech Engn, Mosul, Iraq
[2] Birkbeck Univ London, Dept Comp Sci & Informat Syst, London, England
[3] Al Mustansiriyah Univ, Coll Engn, Baghdad, Iraq
[4] Northumbria Univ, Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国工程与自然科学研究理事会;
关键词
Deep Reinforcement Learning; Genetic Algorithm; Neuron Coverage; Road tracking;
D O I
10.1016/j.asoc.2021.107295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has recently obtained considerable attentions. It empowers Reinforcement Learning (RL) with Deep Learning (DL) techniques to address various difficult tasks. In this paper, a novel approach called the Genetic Algorithm of Neuron Coverage (GANC) is proposed. It is motivated for improving the robustness and performance of a DRL network. The GANC uses Genetic Algorithm (GA) to maximise the Neuron Coverage (NC) of a DRL network by producing augmented inputs. We apply this method in the self-driving car applications, where it is crucial to accurately provide a correct decision for different road tracking views. We evaluate our method on the SYNTHIA-SEQS-05 databases in four different driving environments. Our outcomes are very promising - the best driving accuracy reached 97.75% - and are superior to the state-of-the-art results. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 47 条
  • [1] Al-Hatab M., 2020, TEST ENG MANAG, V11, P3035
  • [2] AL-Hatab M.M., 2020, J CRIT REV, V7, P2598, DOI [DOI 10.31838/JCR.07.05.430, 10.31838/jcr.07.05.430]
  • [3] Al-Nima R.R. O., 2019, International Conference on Computer Recognition Systems, P106
  • [4] Al-Nima R.R.O., 2017, SIGNAL PROCESSING MA
  • [5] Anaz A.S., 2020, Solid State Technology, V63, P3632
  • [6] Andriyanov N., 2020, J PHYS C SERIES, V1661
  • [7] [Anonymous], 2001, PRACTICAL TUTORIAL M
  • [8] [Anonymous], 2014, WORLD ACAD SCI ENG T
  • [9] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [10] Neural Codes for Image Retrieval
    Babenko, Artem
    Slesarev, Anton
    Chigorin, Alexandr
    Lempitsky, Victor
    [J]. COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 584 - 599