Self-organized free-flight arrival for urban air mobility

被引：0

作者：

Waltz, Martin ^{[1
]}

Okhrin, Ostap ^{[1
,2
]}

Schultz, Michael ^{[3
]}

机构：

[1] Tech Univ Dresden, Chair Econometr & Stat, esp Transport Sect, Wuerzburger Str 35, D-01062 Dresden, Germany

[2] ScaDS AI, Ctr Scalable Data Analyt & Artificial Intelligence, Dresden Leipzig, Germany

[3] Univ Bundeswehr Munchen, Inst Flight Syst, D-85577 Neubiberg, Germany

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2024年 / 167卷

关键词：

Deep reinforcement learning; Urban air mobility; eVTOL; REINFORCEMENT; DEMAND; ALGORITHMS; SAFETY; POLICY; EVTOL;

D O I：

10.1016/j.trc.2024.104806

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Urban air mobility is an innovative mode of transportation in which electric vertical takeoff and landing (eVTOL) vehicles operate between nodes called vertiports. We outline a self-organized vertiport arrival system based on deep reinforcement learning. The airspace around the vertiport is assumed to be circular, and the vehicles can freely operate inside. Each aircraft is considered an individual agent and follows a shared policy, resulting in decentralized actions that are based on local information. We investigate the development of the reinforcement learning policy during training and illustrate how the algorithm moves from suboptimal local holding patterns to a safe and efficient final policy. The latter is validated in simulation-based scenarios, including robustness analyses against sensor noise and a changing distribution of inbound traffic. Lastly, we deploy the final policy on small-scale unmanned aerial vehicles to showcase its real-world usability.

引用

页数：21

共 90 条

[71] TEMPORAL DIFFERENCE LEARNING AND TD-GAMMON
TESAURO, G
[J]. COMMUNICATIONS OF THE ACM, 1995, 38 (03) : 58 - 68
[72] Thin L.N., 2016, Int. J. Comput. Network. Commun., V8, P123, DOI 10.5121/ijcnc.2016.8211
[73] Thipphavong D. P., 2018, P AV TECHN INT OP C, P3676, DOI [10.2514/6.2018-3676, DOI 10.2514/6.2018-3676]
[74] Van Rossum G., 2009, PYTHON 3 REFERENCE M
[75] Waltz M., 2022, Rl dresden algorithm suite
[76] Waltz M, 2024, Arxiv, DOI arXiv:2307.16769
[77] Spatial-temporal recurrent reinforcement learning for autonomous ships
Waltz, Martin
Okhrin, Ostap
[J]. NEURAL NETWORKS, 2023, 165 : 634 - 653
[78] Distributed Reinforcement Learning for Robot Teams: a Review
Yutong Wang
Mehul Damani
Pamela Wang
Yuhong Cao
Guillaume Sartoretti
[J]. Current Robotics Reports, 2022, 3 (4): : 239 - 257
[79] Review of Deep Reinforcement Learning Approaches for Conflict Resolution in Air Traffic Control
Wang, Zhuang
Pan, Weijun
Li, Hui
Wang, Xuan
Zuo, Qinghai
[J]. AEROSPACE, 2022, 9 (06)
[80] WHITLEY D, 1994, STAT COMPUT, V4, P65, DOI 10.1007/BF00175354

← 1 2 3 4 5 6 7 8 9 →