Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

被引:110
|
作者
Everett, Michael [1 ]
Chen, Yu Fan [2 ]
How, Jonathan P. [3 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
[2] Facebook Real Labs, Redmond, WA 98052 USA
[3] MIT, Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Collision avoidance; Robots; Reinforcement learning; Vehicle dynamics; Robot sensing systems; Heuristic algorithms; Dynamics; deep reinforcement learning; motion planning; multiagent systems; decentralized execution;
D O I
10.1109/ACCESS.2021.3050338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over short timescales, or lack a principled method to handle a large, possibly varying number of agents. Therefore, this work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules. It extends our previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents, instead of a small, fixed number of neighbors. The proposed algorithm is shown to outperform a classical collision avoidance algorithm, another deep RL-based algorithm, and scales with the number of agents better (fewer collisions, shorter time to goal) than our previously published learning-based approach. Analysis of the LSTM provides insights into how observations of nearby agents affect the hidden state and quantifies the performance impact of various agent ordering heuristics. The learned policy generalizes to several applications beyond the training scenarios: formation control (arrangement into letters), demonstrations on a fleet of four multirotors and on a fully autonomous robotic vehicle capable of traveling at human walking speed among pedestrians.
引用
收藏
页码:10357 / 10377
页数:21
相关论文
共 50 条
  • [41] Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces
    Sawada, Ryohei
    Sato, Keiji
    Majima, Takahiro
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY, 2021, 26 (02) : 509 - 524
  • [42] Collision avoidance for a small drone with a monocular camera using deep reinforcement learning in an indoor environment
    Kim M.
    Kim J.
    Jung M.
    Oh H.
    Journal of Institute of Control, Robotics and Systems, 2020, 26 (06) : 399 - 411
  • [43] Robot Mapless Navigation in VUCA Environments via Deep Reinforcement Learning
    Xue, Bingxin
    Zhou, Fengyu
    Wang, Chaoqun
    Gao, Ming
    Yin, Lei
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (01) : 639 - 649
  • [44] Motion Planning for Mobile Robots-Focusing on Deep Reinforcement Learning: A Systematic Review
    Sun, Huihui
    Zhang, Weijie
    Yu, Runxiang
    Zhang, Yujie
    IEEE ACCESS, 2021, 9 : 69061 - 69081
  • [45] A Multi-Agent Deep Reinforcement Learning Approach for Practical Decentralized UAV Collision Avoidance
    Thumiger, Nicholas
    Deghat, Mohammad
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2174 - 2179
  • [46] COLREGs-compliant multiship collision avoidance based on deep reinforcement learning
    Zhao, Luman
    Roh, Myung-Il
    OCEAN ENGINEERING, 2019, 191
  • [47] Research on Method of Collision Avoidance Planning for UUV Based on Deep Reinforcement Learning
    Gao, Wei
    Han, Mengxue
    Wang, Zhao
    Deng, Lihui
    Wang, Hongjian
    Ren, Jingfei
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)
  • [48] Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment
    Cai Z.
    Hu Y.
    Wen J.
    Zhang L.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 236 - 245
  • [49] Collision Detection and Avoidance for Multi-UAV based on Deep Reinforcement Learning
    Wang, Guanzheng
    Liu, Zhihong
    Xiao, Kun
    Xu, Yinbo
    Yang, Lingjie
    Wang, Xiangke
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7783 - 7789
  • [50] Research on MASS Collision Avoidance in Complex Waters Based on Deep Reinforcement Learning
    Liu, Jiao
    Shi, Guoyou
    Zhu, Kaige
    Shi, Jiahui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)