A Reinforcement Learning based End-to-End Algorithm for Confrontation Problem

被引:0
|
作者
Wang, Siqiang [1 ]
Yao, Haodi [1 ]
Yao, Yu [1 ]
He, Fenghua [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
来源
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC) | 2019年
关键词
Reinforcement Learning; Confrontation; Incomplete Information; Finite State Machine; GAME;
D O I
10.23919/chicc.2019.8866337
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a confrontation problem between two agents is investigated in which one agent is required to find then eliminate the other agent through projecting bullets. A reinforcement learning based algorithm is designed to realize an end-to-end strategy for one agent to confront the other. Self-play method is used to enhance the algorithm. Simulation results show the effectiveness of the proposed algorithm in confrontation with FSM based strategy and human player.
引用
收藏
页码:2594 / 2598
页数:5
相关论文
共 50 条
  • [1] Improvement of End-to-end Automatic Driving Algorithm Based on Reinforcement Learning
    Tang, Jianlin
    Li, Lingyun
    Ai, Yunfeng
    Zhao, Bin
    Ren, Liangcai
    Tian, Bin
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5086 - 5091
  • [2] ORACLE: End-to-End Model Based Reinforcement Learning
    Andersen, Per-Arne
    Goodwin, Morten
    Granmo, Ole-Christoffer
    ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 44 - 57
  • [3] End-to-end Deep Reinforcement Learning Based Coreference Resolution
    Fei, Hongliang
    Li, Xu
    Li, Dingcheng
    Li, Ping
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 660 - 665
  • [4] Deep Reinforcement Learning based Handoff Algorithm in End-to-End Network Slicing Enabling HetNets
    Yang, Deng
    Wu, Wenjun
    Wang, Xiaoxi
    Zhang, Yanhua
    Si, Pengbo
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [5] An End-to-End Deep Reinforcement Learning Framework for Electric Vehicle Routing Problem
    Wang, Mengqin
    Wei, Yanling
    Huang, Xueliang
    Gao, Shan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 33671 - 33682
  • [6] End-to-End Autonomous Driving Decision Based on Deep Reinforcement Learning
    Huang, Zhiqing
    Zhang, Ji
    Tian, Rui
    Zhang, Yanxin
    CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 658 - 662
  • [7] End-to-End Deep Reinforcement Learning based Recommendation with Supervised Embedding
    Liu, Feng
    Guo, Huifeng
    Li, Xutao
    Tang, Ruiming
    Ye, Yunming
    He, Xiuqiang
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 384 - 392
  • [8] End-to-End Reinforcement Learning for Torque Based Variable Height Hopping
    Soni, Raghav
    Harnack, Daniel
    Isermann, Hauke
    Fushimi, Sotaro
    Kumar, Shivesh
    Kirchner, Frank
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7531 - 7538
  • [9] Reinforcement Learning Based VNF Scheduling with End-to-End Delay Guarantee
    Li, Junling
    Shi, Weisen
    Zhang, Ning
    Shen, Xuemin Sherman
    2019 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2019,
  • [10] End-to-End Autonomous Driving Decision Based on Deep Reinforcement Learning
    Huang Z.-Q.
    Qu Z.-W.
    Zhang J.
    Zhang Y.-X.
    Tian R.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (09): : 1711 - 1719