Deep reinforcement learning with predictive auxiliary task for autonomous train collision avoidance

被引:0
|
作者
Plissonneau, Antoine [1 ,2 ]
Jourdan, Luca [1 ]
Trentesaux, Damien [2 ]
Abdi, Lotfi [1 ]
Sallak, Mohamed [1 ,3 ]
Bekrar, Abdelghani [2 ]
Quost, Benjamin [1 ,3 ]
Schoen, Walter [1 ,3 ]
机构
[1] Railenium, Valenciennes, France
[2] Univ Polytech Hauts De France, CNRS, LAMIH, UMR 8201, F-59313 Valenciennes, France
[3] Univ Technol Compiegne, CNRS, Heudiasyc Heurist & Diagnost Syst Complexes, CS 60 319, F-60203 Compiegne, France
关键词
Autonomous train; Collision avoidance; Deep reinforcement learning; Auxiliary task; Interpretability; NEURAL-NETWORKS; NAVIGATION; LEVEL;
D O I
10.1016/j.jrtpm.2024.100453
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The contribution of this paper consists of a deep reinforcement learning (DRL) based method for autonomous train collision avoidance. While DRL applied to autonomous vehicles' collision avoidance has shown interesting results compared to traditional methods, train -like vehicles are not currently covered. In addition, DRL applied to collision avoidance suffers from sparse rewards, which can lead to poor convergence and long training time. To overcome these limitations, this paper proposes a method for training a reinforcement learning (RL) agent for collision avoidance using local obstacle information mapped into occupancy grids. This method also integrates a network architecture containing a predictive auxiliary task consisting in future state prediction and encouraging the intermediate representation to be predictive of obstacle trajectories. A comparison study conducted on multiple simulated scenarios demonstrates that the trained policy outperforms other deep-learning-based policies as well as human driving in terms of both safety and efficiency. As a first step toward the certification of a DRL based method, this paper proposes to approximate the policy learned by the RL agent with an interpretable decision tree. Although this approximation results in a loss of performance, it enables a safety analysis of the learned function and thus paves the way to use the strengths of RL in certifiable algorithms. As this work is pioneering the use of RL for collision avoidance of rail-guided vehicles, and to facilitate future work by other engineers and researchers, a RL-ready simulator is provided with this paper.
引用
收藏
页数:20
相关论文
共 50 条
  • [11] Taming an Autonomous Surface Vehicle for Path Following and Collision Avoidance Using Deep Reinforcement Learning
    Meyer, Eivind
    Robinson, Haakon
    Rasheed, Adil
    San, Omer
    IEEE ACCESS, 2020, 8 : 41466 - 41481
  • [12] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Fei WANG
    Xiaoping ZHU
    Zhou ZHOU
    Yang TANG
    Chinese Journal of Aeronautics, 2024, 37 (03) : 237 - 257
  • [13] CONTROL METHOD FOR PATH FOLLOWING AND COLLISION AVOIDANCE OF AUTONOMOUS SHIP BASED ON DEEP REINFORCEMENT LEARNING
    Zhao, Luman
    Roh, Myung-Il
    Lee, Sung-Jun
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2019, 27 (04): : 293 - 310
  • [14] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Wang, Fei
    Zhu, Xiaoping
    Zhou, Zhou
    Tang, Yang
    CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (03) : 237 - 257
  • [15] Deep Reinforcement Learning for Autonomous Driving with an Auxiliary Actor Discriminator
    Gao, Qiming
    Chang, Fangle
    Yang, Jiahong
    Tao, Yu
    Ma, Longhua
    Su, Hongye
    SENSORS, 2024, 24 (02)
  • [16] A learning method for AUV collision avoidance through deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Du, Xue
    OCEAN ENGINEERING, 2022, 260
  • [17] Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance
    Trumpp, Raphael
    Bayerlein, Harald
    Gesbert, David
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 331 - 336
  • [18] A Novel Dynamically Adjusted Entropy Algorithm for Collision Avoidance in Autonomous Ships Based on Deep Reinforcement Learning
    Chen, Guoquan
    Huang, Zike
    Wang, Weijun
    Yang, Shenhua
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (09)
  • [19] Static and Dynamic Collision Avoidance for Autonomous Robot Navigation in Diverse Scenarios based on Deep Reinforcement Learning
    Pico, Nabih
    Lee, Beomjoon
    Montero, Estrella
    Tadese, Meseret
    Auh, Eugene
    Doh, Myeongyun
    Moon, Hyungpil
    2023 20TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR, 2023, : 281 - 286
  • [20] High-Speed Collision Avoidance using Deep Reinforcement Learning and Domain Randomization for Autonomous Vehicles
    Kontes, Georgios D.
    Scherer, Daniel D.
    Nisslbeck, Tim
    Fischer, Janina
    Mutschler, Christopher
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,