A novel guidance law based on proximal policy optimization

被引:0
作者
Jiang, Yang [1 ]
Yu, Jianglong [1 ]
Li, Qingdong [1 ]
Ren, Zhang [1 ]
Done, Xiwang [1 ,2 ]
Hua, Yongzhao [1 ,2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
来源
2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
reinforcement learning; proximal policy optimization; high-speed maneuvering target; SLIDING-MODE CONTROL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new guidance law based on deep reinforcement learning is proposed for the high-speed maneuvering target attack problem. Firstly, the missile-target kinematic model is established, and the action space and state space of reinforcement learning are designed. Then, according to the missile strike process, the reward function suitable for this scenario is proposed. The proximal policy optimization (PPO) based guidance law construction is completed. Finally, the strike effect in multiple sets of experiments verifies the effectiveness of the method proposed in this paper.
引用
收藏
页码:3364 / 3369
页数:6
相关论文
共 50 条
[31]   Combustion optimization study of pulverized coal boiler based on proximal policy optimization algorithm [J].
Wu, Xuecheng ;
Zhang, Hongnan ;
Chen, Huafeng ;
Wang, Shifeng ;
Gong, Lingling .
APPLIED THERMAL ENGINEERING, 2024, 254
[32]   Proximal Policy Optimization with Entropy Regularization [J].
Shen, Yuqing .
2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, :380-383
[33]   Authentic Boundary Proximal Policy Optimization [J].
Cheng, Yuhu ;
Huang, Longyang ;
Wang, Xuesong .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) :9428-9438
[34]   Proximal policy optimization learning based control of congested freeway traffic [J].
Mo, Shurong ;
Wu, Nailong ;
Qi, Jie ;
Pan, Anqi ;
Feng, Zhiguang ;
Yan, Huaicheng ;
Wang, Yueying .
OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (02) :719-736
[35]   Task Offloading and Resource Allocation Strategies Based on Proximal Policy Optimization [J].
Liu, Kai ;
Yang, Wujun .
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, :693-698
[36]   Traffic Signal Control Method Based on Modified Proximal Policy Optimization [J].
An, Yaohui ;
Zhang, Jing .
2022 10TH INTERNATIONAL CONFERENCE ON TRAFFIC AND LOGISTIC ENGINEERING (ICTLE 2022), 2022, :83-88
[37]   Laser vision seam tracking system based on proximal policy optimization [J].
Zou, Yanbiao ;
Zhou, Hengchang .
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (04) :770-778
[38]   Robust solar sail trajectories using proximal policy optimization [J].
Bianchi, Christian ;
Niccolai, Lorenzo ;
Mengali, Giovanni .
ACTA ASTRONAUTICA, 2025, 226 :702-715
[39]   Automatic Management of Cloud Applications with Use of Proximal Policy Optimization [J].
Funika, Wlodzimierz ;
Koperek, Pawel ;
Kitowski, Jacek .
COMPUTATIONAL SCIENCE - ICCS 2020, PT I, 2020, 12137 :73-87
[40]   Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning [J].
Lopes, Guilherme Cano ;
Ferreira, Murillo ;
Simoes, Alexandre da Silva ;
Colombini, Esther Luna .
15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, :503-508