Deep Reinforcement Learning Policy in Hex Game System

被引:0
|
作者
Lu, Mengxuan [1 ]
Li, Xuejun [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
来源
PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC) | 2018年
关键词
Computer Game; Hex Game; Deep Reinforcement Learning; Actor-Critic A3C; GO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hex game is a zero-sum chess game. It has a large solution space when using 11 x 11 size of chess board. In recent years, deep reinforcement learning -based Go game systems, i.e. AlphaGo and AlphaGo Zero, have gotten huge achievement. In this paper, we design the self-learning method and system structure of Hex game. design policy network and value network referred to residual network, and use asynchronous advantage actor-critic algorithm to train policy network and value network. The comparison of deep reinforcement learning-based policy network and fixed strategy proves better effect of self-learning.
引用
收藏
页码:6623 / 6626
页数:4
相关论文
共 50 条
  • [41] Design and Implementation of Surakarta Game System Based on Reinforcement Learning
    Li, ShuQin
    Qi, YiZhong
    Bo, JianBo
    Fu, Yao
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 6326 - 6329
  • [42] A Study on the Game System of Dots and Boxes Based on Reinforcement Learning
    Zhang, Yajun
    Li, Shuqin
    Xiong, Xiaojun
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 6319 - 6322
  • [43] Development of a Computer Player for Seejeh (AKA Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning
    Aljaafreh, Ahmad
    Al-Oudat, Naeem
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 241 - 247
  • [44] Selective Catalytic Reduction System Ammonia Injection Control Based on Deep Deterministic Policy Reinforcement Learning
    Xie, Peiran
    Zhang, Guangming
    Niu, Yuguang
    Sun, Tianshu
    FRONTIERS IN ENERGY RESEARCH, 2021, 9
  • [45] Utilizing RBC system for taxation policy evaluation: An adaptive interaction framework based on deep reinforcement learning
    Luo, Shuang
    Liu, Shunyu
    Cai, Tianrun
    Wu, Chao
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
  • [46] URNAI: A Multi-Game Toolkit for Experimenting Deep Reinforcement Learning Algorithms
    Araujo, Marco A. S.
    Alves, Luiz P. C.
    Madeira, Charles A. G.
    Nobrega, Marcos M.
    2020 19TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2020), 2020, : 178 - 187
  • [47] A Generalized Circle Agent Based on the Deep Reinforcement Learning for the Game of Geometry Friends
    Sahin, Safa Onur
    Yucesoy, Veysel
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [48] DEEP REINFORCEMENT LEARNING BASED GAME DECISION ALGORITHM FOR DIGITAL MEDIA EDUCATION
    Li, Zu-Ning
    Ping-Kuang
    Zhang, Ting
    Yan, Hua-Rui
    Gu, Xiao-Feng
    2019 16TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP), 2019, : 139 - 142
  • [49] Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
    Gao, Yifan
    Wu, Lezhou
    ELECTRONICS, 2021, 10 (13)
  • [50] Deep Reinforcement Learning with a Classifier System - First Steps
    Schoenberner, Connor
    Tomforde, Sven
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 256 - 270