Generation of Spacecraft Operations Procedures Using Deep Reinforcement Learning

被引:15
作者
Harris, Andrew [1 ]
Valade, Trace [1 ]
Teil, Thibaud [1 ]
Schaub, Hanspeter [2 ]
机构
[1] Univ Colorado, Ann & HJ Smead Dept Aerosp Engn Sci, Boulder, CO 80309 USA
[2] Univ Colorado Boulder, Ann & HJ Smead Dept Aerosp Engn Sci, Colorado Ctr Astrodynam Res, Chair Engn, 431 UCB, Boulder, CO 80309 USA
关键词
D O I
10.2514/1.A35169
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The high cost of space mission operations has motivated several space agencies to prioritize the development of autonomous spacecraft command and control technologies. Deep reinforcement learning (DRL) techniques present one promising domain for the creation of autonomous agents for complex, multifaceted operations problems. This work examines the feasibility of adapting DRL-driven policy generation algorithms to problems in spacecraft decision-making, including strategies for framing spacecraft decision-making problems such as Markov decision processes, avenues for dimensionality reduction, and simplification using expert domain knowledge, sensitivity to hyperparameters, and robustness in the face of mismodeled environmental dynamics. In addition, consideration is given to ensuring the safety of these approaches by hybridizing them with correct-by-construction control techniques in a novel adaptation of shielded deep reinforcement learning. These strategies are demonstrated against a prototypical low-fidelity stationkeeping scenario and a high-fidelity attitude mode management scenario involving flight heritage attitude control and momentum management algorithms. DRL techniques are found to compare favorably to other black-box optimization tools or heuristic solutions for these problems and to require similar network sizes and training durations as widely used testing datasets in the deep learning community.
引用
收藏
页码:611 / 626
页数:16
相关论文
共 36 条
[1]   Information Systems and Renewable Energy in Algeria [J].
Abdelkader, Harrouz ;
Abbes, Meriem ;
Colak, Ilhami ;
Kayisli, Korhan .
PROCEEDINGS OF 2019 ALGERIAN LARGE ELECTRICAL NETWORK CONFERENCE (CAGRE), 2019, :1-5
[2]  
Alibay F., 2017, 31 ANN AIAA USU C SM
[3]  
[Anonymous], 2019, DARPA 2019 STRAT FRA
[4]  
[Anonymous], 2012, EXPT EVALUATION BAYE, DOI DOI 10.2514/6.2012-4542
[5]  
[Anonymous], 2005, JET PROPULSION
[6]   Multiple Lyapunov functions and other analysis tools for switched and hybrid systems [J].
Branicky, MS .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1998, 43 (04) :475-482
[7]  
Chen T, 2013, LECT NOTES COMPUT SC, V7795, P185, DOI 10.1007/978-3-642-36742-7_13
[8]  
Chien S., 2005, J AEROSPACE COMPUTIN, V2, P196, DOI [DOI 10.2514/1.12923, 10.2514/1.12923]
[9]  
Chien SteveA., 2010, P OF THE 20 INT C ON, P34, DOI DOI 10.1609/ICAPS.V20I1.13410
[10]  
Cianciolo A. D., 2013, Autonomous Aerobraking Development Software: Phase 2 Summary, V150