Multi-Drone Collaborative Shepherding Through Multi-Task Reinforcement Learning

被引：1

作者：

Wang, Guanghui ^{[1
]}

Peng, Junkun ^{[2
]}

Guan, Chenyang ^{[1
]}

Chen, Jinhua ^{[2
]}

Guo, Bing ^{[1
]}

机构：

[1] Qinghai Univ, Xining 810016, Qinghai, Peoples R China

[2] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期

关键词：

Drones; Collaboration; Adaptation models; Biological system modeling; Multitasking; Deep reinforcement learning; Heuristic algorithms; Path planning for multiple mobile robots or agents; reinforcement learning; collaboration; shepherding;

D O I：

10.1109/LRA.2024.3468155

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Robotic shepherding has become indispensable in animal husbandry and crowd management, offering a modern solution to traditional challenges. Drone Automated Shepherding leverages advanced maneuverability and an extensive field of view to improve the efficiency of these tasks, which are typically labor-intensive and time-consuming. Existing methods for managing large herds face significant challenges due to insufficient coordination among multiple drones and the complexities involved in simultaneously executing diverse shepherding tasks. This paper aims to enhance the execution of multiple shepherding tasks by optimizing drone coordination, designing optimal flight paths, and reducing flight time. To harness the potential of reinforcement learning, we develop a multi-drone collaborative shepherding environment that facilitates efficient drone training using a dense reward system. Additionally, we employ a multi-task deep reinforcement learning algorithm that enhances the sample efficiency and reward performance by leveraging shared information across tasks within this environment. Two specific tasks, driving and collecting, are used to assess the performance of our methodology. The effectiveness of our approach is measured against a classical solution named CTRL, examining metrics such as success rate, completion time, and flight path length. Results indicate that our approach significantly outperforms the CTRL in all measured metrics. Visualization of drone trajectories provides further evidence of our enhanced collaboration and efficiency in shepherding operations. Real-world experiments are conducted on a square of 300 m(3) , where two drones utilize our method to guide four small autonomous vehicles from the starting area to the goal area within 30 seconds.

引用

页码：10311 / 10318

页数：8

共 34 条

[1]

[Anonymous], 2020, Titan RTX

[2]

[Anonymous], 2024, NVIDIA Jetson Xavier

[3]

[Anonymous], 2020, Intel Xeon Silver 4210 CPU

[4]

[Anonymous], 2022, PyTorch 1.13 release, including beta versions of functorch and improved support for Apple's new M1 chips

[5] Shepherding algorithm of Multi-mobile robot system [J].

Bat-Erdene, Byambasuren ;

Mandakh, Oyun-Erdene .

2017 FIRST IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2017, :358-361

[6]

Baumann M., 2016, Learning shepherding behavior

[7] The Arcade Learning Environment: An Evaluation Platform for General Agents [J].

Bellemare, Marc G. ;

Naddaf, Yavar ;

Veness, Joel ;

Bowling, Michael .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 :253-279

[8]

D'Eramo C, 2024, Arxiv, DOI arXiv:2401.09561

[9] A Neuro-Evolution Approach to Shepherding Swarm Guidance in the Face of Uncertainty [J].

Debie, Essam ;

Singh, Hemant ;

Elsayed, Saber ;

Perry, Anthony ;

Hunjet, Robert ;

Abbass, Hussein .

2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, :2634-2641

[10]

Fujioka K, 2016, PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), P3179, DOI 10.1109/TENCON.2016.7848636

← 1 2 3 4 →