Cooperative Collision Avoidance for Multi-Vehicle Systems Using Reinforcement Learning

被引:0
作者
Wang, Qichen [1 ]
Phillips, Chris [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
来源
2013 18TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR) | 2013年
关键词
Cooperative Path Planning; Collision Avoidance; Reinforcement Learning; Adaptive Dynamic Programming;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collision avoidance is a fundamental problem in navigation. In this paper, we present a novel method of cooperative movement planning to examine how two vehicles can orchestrate their movements so as to avoid collisions and subsequently return to their intended paths. Movement planning in this research is solved by regarding it as a decision process. When the vehicles are at risk of a collision, the system determines appropriate steering motions for both vehicles at each time step, so that they can cooperatively change course to avoid collisions and return to their original course when the risk is averted. Reinforcement learning is applied to solve this decision-making task. States of the system are described in terms of the vehicles' position and orientation and actions are defined considering the kinematic constraints of the vehicles. In reinforcement learning, an approximate value function is iteratively developed according to certain rules to evaluate state-action combinations of the system. Appropriate motions are selected by the system after calculating the approximate value of possible target states, which also satisfy the requirement of the smoothness of paths, as well as the distances between, and velocities of, both vehicles. The method of least squares is applied in the iterative mechanism to update the approximate value function given a scoring technique for a collection of state samples featuring continuous state space and action space. This paper summarizes the concept and methodologies used to implement an online cooperative collision avoidance system. Different scenarios are tested to assess the performance of the proposed algorithm.
引用
收藏
页码:98 / 102
页数:5
相关论文
共 15 条
[1]  
[Anonymous], 2010, ARTIF INTELL
[2]   Multi-UAV Convoy Protection: An Optimal Approach to Path Planning and Coordination [J].
Ding, Xu Chu ;
Rahmani, Amir R. ;
Egerstedt, Magnus .
IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (02) :256-268
[3]  
Fierro R., 2012, IEEE INT C CONTR APP
[4]  
Goldman J.A., 1994, P IEEE NAT AER EL C
[5]  
Hocaoglu C., 1998, EV COMP P IEEE WORLD
[6]  
Hong K. S., 2011, IEEE T IND ELECT, V58
[7]  
Janabi-Sharifi F., P 1993 IEEE INT S IN
[8]   A Bezier curve based path planning in a multi-agent robot soccer system without violating the acceleration limits [J].
Jolly, K. G. ;
Kumar, R. Sreerama ;
Vijayakumar, R. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (01) :23-33
[9]  
Lizarraga M.I., 2008, IEEE ION POS LOC NAV
[10]  
Pamosoaji A.K., 2011, IEEE C ROB AUT MECH