A novel guidance law based on proximal policy optimization

被引:0
作者
Jiang, Yang [1 ]
Yu, Jianglong [1 ]
Li, Qingdong [1 ]
Ren, Zhang [1 ]
Done, Xiwang [1 ,2 ]
Hua, Yongzhao [1 ,2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
来源
2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
reinforcement learning; proximal policy optimization; high-speed maneuvering target; SLIDING-MODE CONTROL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new guidance law based on deep reinforcement learning is proposed for the high-speed maneuvering target attack problem. Firstly, the missile-target kinematic model is established, and the action space and state space of reinforcement learning are designed. Then, according to the missile strike process, the reward function suitable for this scenario is proposed. The proximal policy optimization (PPO) based guidance law construction is completed. Finally, the strike effect in multiple sets of experiments verifies the effectiveness of the method proposed in this paper.
引用
收藏
页码:3364 / 3369
页数:6
相关论文
共 50 条
  • [21] Decision Planning for Autonomous Driving Based on Proximal Policy Optimization
    Li, Shuang
    Liu, Chunsheng
    Nie, Zhaoying
    [J]. PROCEEDINGS OF THE 2024 3RD INTERNATIONAL SYMPOSIUM ON INTELLIGENT UNMANNED SYSTEMS AND ARTIFICIAL INTELLIGENCE, SIUSAI 2024, 2024, : 145 - 148
  • [22] Proximal policy optimization with an integral compensator for quadrotor control
    Hu, Huan
    Wang, Qing-ling
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (05) : 777 - 795
  • [23] Proximal policy optimization with an integral compensator for quadrotor control
    Huan Hu
    Qing-ling Wang
    [J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 777 - 795
  • [24] Proximal Policy Optimization-Based Power Grid Structure Optimization for Reliable Splitting
    Sun, Xinwei
    Han, Shuangteng
    Wang, Yuhong
    Shi, Yunxiang
    Liao, Jianquan
    Zheng, Zongsheng
    Wang, Xi
    Shi, Peng
    [J]. ENERGIES, 2024, 17 (04)
  • [25] Combustion optimization study of pulverized coal boiler based on proximal policy optimization algorithm
    Wu, Xuecheng
    Zhang, Hongnan
    Chen, Huafeng
    Wang, Shifeng
    Gong, Lingling
    [J]. APPLIED THERMAL ENGINEERING, 2024, 254
  • [26] Proximal Policy Optimization with Entropy Regularization
    Shen, Yuqing
    [J]. 2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 380 - 383
  • [27] Authentic Boundary Proximal Policy Optimization
    Cheng, Yuhu
    Huang, Longyang
    Wang, Xuesong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9428 - 9438
  • [28] Proximal policy optimization learning based control of congested freeway traffic
    Mo, Shurong
    Wu, Nailong
    Qi, Jie
    Pan, Anqi
    Feng, Zhiguang
    Yan, Huaicheng
    Wang, Yueying
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (02) : 719 - 736
  • [29] Task Offloading and Resource Allocation Strategies Based on Proximal Policy Optimization
    Liu, Kai
    Yang, Wujun
    [J]. 2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 693 - 698
  • [30] Laser vision seam tracking system based on proximal policy optimization
    Zou, Yanbiao
    Zhou, Hengchang
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (04): : 770 - 778