multiple UAVs;
deep reinforcement learning;
PPO;
curriculum learning;
Ray;
NAVIGATION;
D O I:
10.3390/drones6070166
中图分类号:
TP7 [遥感技术];
学科分类号:
081102 ;
0816 ;
081602 ;
083002 ;
1404 ;
摘要:
Distributed multi-agent collaborative decision-making technology is the key to general artificial intelligence. This paper takes the self-developed Unity3D collaborative combat environment as the test scenario, setting a task that requires heterogeneous unmanned aerial vehicles (UAVs) to perform a distributed decision-making and complete cooperation task. Aiming at the problem of the traditional proximal policy optimization (PPO) algorithm's poor performance in the field of complex multi-agent collaboration scenarios based on the distributed training framework Ray, the Critic network in the PPO algorithm is improved to learn a centralized value function, and the muti-agent proximal policy optimization (MAPPO) algorithm is proposed. At the same time, the inheritance training method based on course learning is adopted to improve the generalization performance of the algorithm. In the experiment, MAPPO can obtain the highest average accumulate reward compared with other algorithms and can complete the task goal with the fewest steps after convergence, which fully demonstrates that the MAPPO algorithm outperforms the state-of-the-art.
机构:
Univ Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
Jafari, Mohammad
Xu, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USAUniv Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
Xu, Hao
Carrillo, Luis Rodolfo Garcia
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Univ, Sch Engn & Comp Sci, Dept Elect Engn, 6300 Ocean Dr,Unit 5797, Corpus Christi, TX 78412 USAUniv Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
机构:
Univ Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
Jafari, Mohammad
Xu, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USAUniv Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
Xu, Hao
Carrillo, Luis Rodolfo Garcia
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Univ, Sch Engn & Comp Sci, Dept Elect Engn, 6300 Ocean Dr,Unit 5797, Corpus Christi, TX 78412 USAUniv Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA