WagerWin: An Efficient Reinforcement Learning Framework for Gambling Games

被引:1
|
作者
Wang, Haoli [1 ]
Wu, Hejun [1 ]
Lai, Guoming [2 ]
机构
[1] Sun Yat Sen Univ, Dept Comp Sci & Engn, Guangzhou 510275, Peoples R China
[2] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Artificial intelligence; Training; Reinforcement learning; Training data; Monte Carlo methods; Law; Gambling games; game AI; reinforcement learning (RL); NETWORKS; POKER; GO;
D O I
10.1109/TG.2022.3226526
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although reinforcement learning (RL) has achieved great success in diverse scenarios, complex gambling games still pose great challenges for RL. Common deep RL methods have difficulties maintaining stability and efficiency in such games. By theoretical analysis, we find that the return distribution of a gambling game is an intrinsic factor of this problem. Such return distribution of gambling games is partitioned into two parts, depending on the win/lose outcome. These two parts represent the gain and loss. They repel each other because the player keeps "raising," i.e., making a wager. However, common deep RL methods directly approximate the expectation of the return, without considering the particularity of the distribution. This way causes a redundant loss term in the objective function and a subsequent high variance. In this work, we propose WagerWin, a new framework for gambling games. WagerWin introduces probability and value factorization to construct a more effective value function. Our framework removes the redundant loss term of the objective function in training. In addition, WagerWin supports customized policy adaptation, which can tune the pretrained policy for different inclinations. We conduct extensive experiments on DouDizhu and SmallDou, a reduced version of DouDizhu. The results demonstrate that WagerWin outperforms the original state-of-the-art RL model in both training efficiency and stability.
引用
收藏
页码:483 / 491
页数:9
相关论文
共 50 条
  • [41] Adaptive and Efficient Qubit Allocation Using Reinforcement Learning in Quantum Networks
    Gao, Yanan
    Yang, Song
    Li, Fan
    Fu, Xiaoming
    IEEE NETWORK, 2022, 36 (05): : 48 - 55
  • [42] Hybrid-Pursuit Strategies in Multiple Pursuer-Evader Games Using Reinforcement Learning
    Guan, Yacun
    Xu, Wang
    Liu, Guohua
    IEEE ACCESS, 2024, 12 : 187709 - 187721
  • [43] Reinforcement Learning for Efficient Network Penetration Testing
    Ghanem, Mohamed C.
    Chen, Thomas M.
    INFORMATION, 2020, 11 (01)
  • [44] Efficient Halftoning via Deep Reinforcement Learning
    Jiang, Haitian
    Xiong, Dongliang
    Jiang, Xiaowen
    Ding, Li
    Chen, Liang
    Huang, Kai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5494 - 5508
  • [45] TransMap: An Efficient CGRA Mapping Framework via Transformer and Deep Reinforcement Learning
    Li, Jingyuan
    Dai, Yuan
    Hu, Yihan
    Li, Jiangnan
    Yin, Wenbo
    Tao, Jun
    Wang, Lingli
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 626 - 633
  • [46] Deep reinforcement learning for conservation decisions
    Lapeyrolerie, Marcus
    Chapman, Melissa S.
    Norman, Kari E. A.
    Boettiger, Carl
    METHODS IN ECOLOGY AND EVOLUTION, 2022, 13 (11): : 2649 - 2662
  • [47] Modeling Decisions in Games Using Reinforcement Learning
    Singal, Himanshu
    Aggarwal, Palvi
    Dutt, Varun
    2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 98 - 105
  • [48] Reinforcement Learning From Hierarchical Critics
    Cao, Zehong
    Lin, Chin-Teng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 1066 - 1073
  • [49] Inverse Reinforcement Learning for Adversarial Apprentice Games
    Lian, Bosen
    Xue, Wenqian
    Lewis, Frank L.
    Chai, Tianyou
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4596 - 4609
  • [50] Reinforcement Learning in First Person Shooter Games
    McPartland, Michelle
    Gallagher, Marcus
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2011, 3 (01) : 43 - 56