A Cyclic Hyper-parameter Selection Approach for Reinforcement Learning-based UAV Path Planning

被引：0

作者：

Jones, Michael R. ^{[1
]}

Djahel, Soufiene ^{[2
]}

Welsh, Kristopher ^{[1
]}

机构：

[1] Manchester Metropolitan Univ, Dept Comp & Math, Manchester, Lancs, England

[2] Univ Huddersfield, Sch Comp & Engn, Huddersfield, W Yorkshire, England

来源：

2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC | 2024年

关键词：

UAV; unmanned aerial vehicles; reinforcement learning; q-learning; hyper-parameters; path planning; self-tuning;

D O I：

10.1109/CCNC51664.2024.10454801

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unmanned Aerial Vehicles (UAVs) offer new ways to fulfil a variety of urban transportation and service solutions. The ability to successfully plan and re-plan paths across a complex urban environment remains an unsolved significant problem. New Q-learning approaches have potential to address this problem, however they must first learn complex environment spaces. A traditional challenge within this field is the selection of suitable learning hyper-parameters that assist a Q-learning algorithm in achieving an optimal learning policy. It is known that testing and evaluating multiple hyper-parameter combinations is computationally expensive. Thus, this paper proposes a new method for hyper-parameter self-tuning, cyclically assigning hyper-parameters within a single learning process, eliminating the need to experimentally seek optimal hyper-parameter value combinations. Evaluation of the captured results show, training with cyclical hyper-parameter exploration instead of fixed values, achieves improved path generation, while reducing the cumulative learning time required. Although the focus of this approach is centred around a Multi Q-table Path Planning solution, this work presents a practical tool applicable to Reinforcement Learning techniques generally.

引用

页码：792 / 798

页数：7

共 14 条

[1]

Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26

[2]

Bergstra J, 2012, J MACH LEARN RES, V13, P281

[3]

Barsce JC, 2017, 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI)

[4]

Fran‡ois-Lavet V, 2016, Arxiv, DOI arXiv:1512.02011

[5] Beyond Manual Tuning of Hyperparameters [J].

Hutter, Frank ;

Luecke, Joerg ;

Schmidt-Thieme, Lars .

KUNSTLICHE INTELLIGENZ, 2015, 29 (04) :329-337

[6]

Jones M. R., 2022, P IEEE 19 ANN CONS C, P457

[7]

Lu Z., 2019, arXiv

[8]

Breuel TM, 2015, Arxiv, DOI [arXiv:1508.02788, DOI 10.48550/ARXIV.1508.02788]

[9]

Oller D, 2020, Arxiv, DOI arXiv:2004.07707

[10] Cyclical Learning Rates for Training Neural Networks [J].

Smith, Leslie N. .

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :464-472

← 1 2 →