A novel guidance law based on proximal policy optimization

被引:0
作者
Jiang, Yang [1 ]
Yu, Jianglong [1 ]
Li, Qingdong [1 ]
Ren, Zhang [1 ]
Done, Xiwang [1 ,2 ]
Hua, Yongzhao [1 ,2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
来源
2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
reinforcement learning; proximal policy optimization; high-speed maneuvering target; SLIDING-MODE CONTROL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new guidance law based on deep reinforcement learning is proposed for the high-speed maneuvering target attack problem. Firstly, the missile-target kinematic model is established, and the action space and state space of reinforcement learning are designed. Then, according to the missile strike process, the reward function suitable for this scenario is proposed. The proximal policy optimization (PPO) based guidance law construction is completed. Finally, the strike effect in multiple sets of experiments verifies the effectiveness of the method proposed in this paper.
引用
收藏
页码:3364 / 3369
页数:6
相关论文
共 25 条
[1]   APPLICATION OF SLIDING-MODE CONTROL TO AIR-AIR INTERCEPTION PROBLEM [J].
BRIERLEY, SD ;
LONGCHAMP, R .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1990, 26 (02) :306-325
[2]   Mars atmospheric entry guidance for reference trajectory tracking based on robust nonlinear compound controller [J].
Dai, Juan ;
Gao, Ai ;
Xia, Yuanqing .
ACTA ASTRONAUTICA, 2017, 132 :221-229
[3]   Nonsingular terminal sliding mode control technique for attitude tracking problem of a small satellite with combined energy and attitude control system (CEACS) [J].
Eshghi, Samira ;
Varatharajoo, Renuganth .
AEROSPACE SCIENCE AND TECHNOLOGY, 2018, 76 :14-26
[4]   Study on Reinforcement Learning-Based Missile Guidance Law [J].
Hong, Daseon ;
Kim, Minjeong ;
Park, Sungsu .
APPLIED SCIENCES-BASEL, 2020, 10 (18)
[5]  
Kim H G, 2018, IEEE T AERO ELEC SYS, V55, P82
[6]  
Lillicrap TP., 2015, ARXIV
[7]   Missile guidance law design using adaptive cerebellar model articulation controller [J].
Lin, CM ;
Peng, YF .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03) :636-644
[8]  
Mnih V., 2013, CoRR abs/1312.5602
[9]   WHY MODERN CONTROLLERS CAN GO UNSTABLE IN PRACTICE [J].
NESLINE, FW ;
ZARCHAN, P .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1984, 7 (04) :495-500
[10]   Composite Nonsingular Terminal Sliding Mode Attitude Controller for Spacecraft With Actuator Dynamics Under Matched and Mismatched Disturbances [J].
Qiao, Jianzhong ;
Li, Zhenxing ;
Xu, Jianwei ;
Yu, Xiang .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (02) :1153-1162