Consensus Based Reinforcement Learning Method for Differentiated Formation Control of UAVs

被引:1
作者
He, Xingyu [1 ]
Luo, Dongxu [1 ]
Chen, Zekun [1 ]
Yang, Guisong [1 ]
Liu, Yunhuai [2 ,3 ]
机构
[1] Univ Shanghai Sci & Technol, Dept Comp Sci & Engn, Shanghai 200093, Peoples R China
[2] Peking Univ, Dept Comp Sci & Engn, Beijing 100871, Peoples R China
[3] Peking Univ, Chongqing Res Inst Big Data, Chongqing 400039, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Collaboration; Reinforcement learning; Autonomous aerial vehicles; Formation control; Vectors; Routing protocols; Consensus protocol; Maintenance; Delays; Collision avoidance; Consensus mechanism; formation control; obstacle avoidance consistency; obstacle avoidance flexibility; reinforcement learning;
D O I
10.1109/TVT.2024.3473969
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Obstacle avoidance is the crux of formation control for a UAV swarm, no matter in urban or wild application environments. But there is a dilemma to promote both of obstacle avoidance flexibility and consistency in UAV formation control, which has not been solved effectively by existing researches. In view of this, this paper proposes a consensus based reinforcement learning method for differentiated formation control (DFC) of UAVs, to balance the obstacle avoidance flexibility and consistency. In this method, to promote obstacle avoidance flexibility of a UAV swarm, a consensus mechanism with differentiated formation control strategies is designed, it allows each UAV in a swarm changes its formation control strategy among aggregation, formation keeping and obstacle avoidance according to its local environment, and calculates its own current subgoal based on the selected strategy. Further, to improve the flight efficiency of the UAV swarm, a reinforcement learning model is provided to generate the optimal offset vector for each UAV according to its current subgoal. Moreover, to enhance the obstacle avoidance consistency of the UAV swarm, a collaborative obstacle avoidance algorithm is designed in the obstacle avoidance strategy, it requires UAVs to share their obstacle information and obstacle avoidance actions, and provides obstacle avoidance consensus rules to help UAVs to choose consistent obstacle avoidance directions. The experiment results show that the proposed method can combine obstacle avoidance flexibility and consistency of UAVs, thereby achieving higher flight efficiency and maintaining stable network connectivity.
引用
收藏
页码:2429 / 2442
页数:14
相关论文
共 39 条
[11]   Leader-follower Optimal Selection Method for Distributed Control System in Active Distribution Networks [J].
Le, Jian ;
Zhao, Liangang ;
Wang, Cao ;
Zhou, Qian ;
Wang, Yang .
CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2024, 10 (01) :314-323
[12]   Leader-follower formation of light-weight UAVs with novel active disturbance rejection control [J].
Li, Jiacheng ;
Liu, Junmin ;
Huangfu, Shuaiqi ;
Cao, Guoyan ;
Yu, Dengxiu .
APPLIED MATHEMATICAL MODELLING, 2023, 117 :577-591
[13]   Multi-Agent Reinforcement Learning-Based Coordinated Dynamic Task Allocation for Heterogenous UAVs [J].
Liu, Da ;
Dou, Liqian ;
Zhang, Ruilong ;
Zhang, Xiuyun ;
Zong, Qun .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (04) :4372-4383
[14]   A Comprehensive Review of Shepherding as a Bio-Inspired Swarm-Robotics Guidance Approach [J].
Long, Nathan K. ;
Sammut, Karl ;
Sgarioto, Daniel ;
Garratt, Matthew ;
Abbass, Hussein A. .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (04) :523-537
[15]   GrpAvoid: Multigroup Collision-Avoidance Control and Optimization for UAV Swarm [J].
Luo, Linbo ;
Wang, Xinyu ;
Ma, Jianfeng ;
Ong, Yew-Soon .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) :1776-1789
[16]   Multivehicle Flocking With Collision Avoidance via Distributed Model Predictive Control [J].
Lyu, Yang ;
Hu, Jinwen ;
Chen, Ben M. ;
Zhao, Chunhui ;
Pan, Quan .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (05) :2651-2662
[17]   A survey of UAV-based data collection: Challenges, solutions and future perspectives [J].
Messaoudi, Kaddour ;
Oubbati, Omar Sami ;
Rachedi, Abderrezak ;
Lakas, Abderrahmane ;
Bendouma, Tahar ;
Chaib, Noureddine .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 216
[18]   Formation Control of Multi-quadrotors Based on Deep Q-learning [J].
Mousavifard, Roujin ;
Alipour, Khalil ;
Najafqolian, Mohammad Amin ;
Zarafshan, Payam .
2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, :172-177
[19]   Multi-UAV Assisted Network Coverage Optimization for Rescue Operations using Reinforcement Learning [J].
Oubbati, Omar Sami ;
Badis, Hakim ;
Rachedi, Abderrezak ;
Lakas, Abderrahmane ;
Lorenz, Pascal .
2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
[20]   Applications of Unmanned Aerial Vehicles in Mining from Exploration to Reclamation: A Review [J].
Park, Sebeom ;
Choi, Yosoon .
MINERALS, 2020, 10 (08) :1-32