Multirobot Cooperative Learning for Predator Avoidance

被引:109
作者
Hung Manh La [1 ]
Lim, Ronny [1 ]
Sheng, Weihua [2 ]
机构
[1] Rutgers State Univ, Ctr Adv Infrastruct & Transportat, Piscataway, NJ 08854 USA
[2] Oklahoma State Univ, Sch Elect & Comp Engn, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
Flocking control; hybrid system; multirobot systems; reinforcement learning; FLOCKING; SYSTEMS; TRANSITION; CONSENSUS; AGENTS; SIZE;
D O I
10.1109/TCST.2014.2312392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multirobot collaboration has great potentials in tasks, such as reconnaissance and surveillance. In this paper, we propose a multirobot system that integrates reinforcement learning and flocking control to allow robots to learn collaboratively to avoid predator/enemy. Our system can conduct concurrent learning in a distributed fashion as well as generate efficient combination of high-level behaviors (discrete states and actions) and low-level behaviors (continuous states and actions) for multirobot cooperation. In addition, the combination of reinforcement learning and flocking control enables multirobot networks to learn how to avoid predators while maintaining network topology and connectivity. The convergence and scalability of the proposed system are investigated. Simulations and experiments are performed to demonstrate the effectiveness of the proposed system.
引用
收藏
页码:52 / 63
页数:12
相关论文
共 40 条
[1]  
[Anonymous], 1998, Reinforcement Learning: An Introduction
[2]   Reinforcement learning in continuous time and space: Interference and not ill conditioning is the main problem when using distributed function approximators [J].
Baddeley, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :950-956
[3]   Recent Advances in Hierarchical Reinforcement Learning [J].
Andrew G. Barto ;
Sridhar Mahadevan .
Discrete Event Dynamic Systems, 2003, 13 (4) :341-379
[4]  
Bullo F., 2018, LECT NETWORK SYSTEMS
[5]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[6]   Distributed learning and cooperative control for multi-agent systems [J].
Choi, Jongeun ;
Oh, Songhwai ;
Horowitz, Roberto .
AUTOMATICA, 2009, 45 (12) :2802-2814
[7]  
Coggan M., 2001, P 4 INT C COMP INT M, P1
[8]   Collective memory and spatial sorting in animal groups [J].
Couzin, ID ;
Krause, J ;
James, R ;
Ruxton, GD ;
Franks, NR .
JOURNAL OF THEORETICAL BIOLOGY, 2002, 218 (01) :1-11
[9]   Hybrid Systems in Robotics Toward Reachability-Based Controller Design [J].
Ding, Jerry ;
Gillula, Jeremy H. ;
Huang, Haomiao ;
Vitus, Michael P. ;
Zhang, Wei ;
Tomlin, Claire J. .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2011, 18 (03) :33-43
[10]   Effective Robot Team Control Methodologies for Battlefield Applications [J].
Fields, MaryAnne ;
Haas, Ellen ;
Hill, Susan ;
Stachowiak, Christopher ;
Barnes, Laura .
2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, :5862-+