Design of Autonomous UAV Guidance System Using Monte Carlo Tree Search

被引:0
作者
Vasutapituks, Apichart [1 ]
Chong, Edwin K. P. [1 ]
机构
[1] Colorado State Univ, Dept Elect & Comp Engn, Ft Collins, CO 80523 USA
来源
2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022) | 2022年
关键词
Path planning; Partially observable Markov decision processes; Monte Carlo tree search; An online path planning algorithm; Unmanned aerial vehicles (UAVs);
D O I
10.1109/ICBIR54589.2022.9786433
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we propose an online path-planning algorithm using a variation of Monte Carlo tree search (MCTS) for navigating intelligently unmanned aerial vehicles (UAVs) to track mobile ground targets, called the P-UAV algorithm. The proposed algorithm employs a non-myopic method applied to a partially observable Markov decision process (POMDP) model, accounting for long-term decision making. Our algorithm integrates a heuristic technique to efficiently generate paths. The algorithm yields to parallel processing methods to significantly enhance its computational performance, making it suitable for realtime implementation. Simulation experiments show that our path-planning algorithm is efficient and achieves good exploration-exploitation tradeoff in finding a near-optimal solution despite the very large search space.
引用
收藏
页码:677 / 682
页数:6
相关论文
共 14 条
[1]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[2]  
Carmo A. R., 2020, 9 INT C PRESTIGIOUS
[3]   Partially Observable Markov Decision Process Approximations for Adaptive Sensing [J].
Chong, Edwin K. P. ;
Kreucher, Christopher M. ;
Hero, Alfred O., III .
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2009, 19 (03) :377-422
[4]  
Coulom R, 2007, LECT NOTES COMPUT SC, V4630, P72
[5]   Monte-Carlo tree search and rapid action value estimation in computer Go [J].
Gelly, Sylvain ;
Silver, David .
ARTIFICIAL INTELLIGENCE, 2011, 175 (11) :1856-1875
[6]  
Katt S, 2017, PR MACH LEARN RES, V70
[7]  
Li JK, 2016, 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), P5701, DOI 10.1109/IROS.2016.7759839
[8]   A POMDP Framework for Coordinated Guidance of Autonomous UAVs for Multitarget Tracking [J].
Miller, Scott A. ;
Harris, Zachary A. ;
Chong, Edwin K. P. .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
[9]   UAV Path Planning in a Dynamic Environment via Partially Observable Markov Decision Process [J].
Ragi, Shankarachary ;
Chong, Edwin K. P. .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2013, 49 (04) :2397-2412
[10]  
Ragi S, 2012, P AMER CONTR CONF, P3845