Realizing Stabilized Landing for Computation-Limited Reusable Rockets: A Quantum Reinforcement Learning Approach

被引：8

作者：

Kim, Gyu Seon ^{[1
]}

Chung, Jaehyun ^{[1
]}

Park, Soohyun ^{[2
]}

机构：

[1] Korea Univ, Dept Elect & Comp Engn, Seoul 02841, South Korea

[2] Sookmyung Womens Univ, Dept Comp Sci, Seoul 04310, South Korea

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 08期

关键词：

Rockets; Engines; Control systems; Qubit; Quantum computing; Neural networks; Reinforcement learning; Reusable rockets; stabilized control; quantum reinforcement learning;

D O I：

10.1109/TVT.2024.3373901

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The advent of reusable rockets has heralded a new era in space exploration, reducing the costs of launching satellites by a significant factor. Traditional rockets were disposable, but the design of reusable rockets for repeated use has revolutionized the financial dynamics of space missions. The most critical phase of reusable rockets is the landing stage, which involves managing the tremendous speed and altitude for safe recovery. The complexity of this task presents new challenges for control systems, specifically in terms of precision and adaptability. Classical control systems like the model predictive control (MPC) controller lack the flexibility to adapt to dynamic system changes, making the re-design of the controller costly and time consuming. This paper explores the integration of quantum reinforcement learning into the control systems of reusable rockets as a promising alternative. Unlike classical reinforcement learning, quantum reinforcement learning uses quantum bits that can exist in superposition, allowing for more efficient information encoding and reducing the number of parameters required. This leads to increased computational efficiency, reduced memory requirements, and more stable and predictable performance. Due to the nature of reusable rockets, which must be light, heavy computers cannot fit into them. In the reusable rocket scenario, quantum reinforcement learning, which has reduced memory requirements due to fewer parameters, is a good solution.

引用

页码：12252 / 12257

页数：6

共 11 条

[1]

Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]

[2]

Jones H.W., 2018, 48 INT C ENV SYST

[3] Stabilized Detection Accuracy Maximization Using Adaptive SAR Image Processing in LEO Networks [J].

Kim, Kyeongrok ;

Lee, Jung-Hoon ;

Jung, Soyi ;

Kim, Joongheon ;

Kim, Jae-Hyun .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (05) :5661-5665

[4] CVXGEN: a code generator for embedded convex optimization [J].

Mattingley, Jacob ;

Boyd, Stephen .

OPTIMIZATION AND ENGINEERING, 2012, 13 (01) :1-27

[5]

Mnih V, 2013, Arxiv, DOI [arXiv:1312.5602, DOI 10.48550/ARXIV.1312.5602]

[6]

Nascimento EO, 2017, IEEE LAT AM T, V15, P1556, DOI [10.1109/tla.2017.7994806, 10.1109/TLA.2017.7994806]

[7] Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation [J].

Park, Soohyun ;

Kim, Jae Pyoung ;

Park, Chanyoung ;

Jung, Soyi ;

Kim, Joongheon .

IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (06) :106-112

[8]

Souza Gabriel de Almeida, THRUST VECTORED ROCK, V55

[9]

Su L., 2022, P IEEE INT C UNM SYS, P214

[10]

Sutton RS, 2000, ADV NEUR IN, V12, P1057

← 1 2 →