WagerWin: An Efficient Reinforcement Learning Framework for Gambling Games

被引:1
|
作者
Wang, Haoli [1 ]
Wu, Hejun [1 ]
Lai, Guoming [2 ]
机构
[1] Sun Yat Sen Univ, Dept Comp Sci & Engn, Guangzhou 510275, Peoples R China
[2] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Artificial intelligence; Training; Reinforcement learning; Training data; Monte Carlo methods; Law; Gambling games; game AI; reinforcement learning (RL); NETWORKS; POKER; GO;
D O I
10.1109/TG.2022.3226526
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although reinforcement learning (RL) has achieved great success in diverse scenarios, complex gambling games still pose great challenges for RL. Common deep RL methods have difficulties maintaining stability and efficiency in such games. By theoretical analysis, we find that the return distribution of a gambling game is an intrinsic factor of this problem. Such return distribution of gambling games is partitioned into two parts, depending on the win/lose outcome. These two parts represent the gain and loss. They repel each other because the player keeps "raising," i.e., making a wager. However, common deep RL methods directly approximate the expectation of the return, without considering the particularity of the distribution. This way causes a redundant loss term in the objective function and a subsequent high variance. In this work, we propose WagerWin, a new framework for gambling games. WagerWin introduces probability and value factorization to construct a more effective value function. Our framework removes the redundant loss term of the objective function in training. In addition, WagerWin supports customized policy adaptation, which can tune the pretrained policy for different inclinations. We conduct extensive experiments on DouDizhu and SmallDou, a reduced version of DouDizhu. The results demonstrate that WagerWin outperforms the original state-of-the-art RL model in both training efficiency and stability.
引用
收藏
页码:483 / 491
页数:9
相关论文
共 50 条
  • [21] A Unifying Framework for Reinforcement Learning and Planning
    Moerland, Thomas M.
    Broekens, Joost
    Plaat, Aske
    Jonker, Catholijn M.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [22] Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games
    Lou, Xingzhou
    Zhang, Junge
    Du, Yali
    Yu, Chao
    He, Zhaofeng
    Huang, Kaiqi
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (02) : 470 - 482
  • [23] Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games
    Weichao Mao
    Tamer Başar
    Dynamic Games and Applications, 2023, 13 : 165 - 186
  • [24] Spreeze: High-Throughput Parallel Reinforcement Learning Framework
    Hou, Jing
    Chen, Guang
    Zhang, Ruiqi
    Li, Zhijun
    Gu, Shangding
    Jiang, Changjun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (02) : 282 - 292
  • [25] On Passivity, Reinforcement Learning, and Higher Order Learning in Multiagent Finite Games
    Gao, Bolin
    Pavel, Lacra
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (01) : 121 - 136
  • [26] Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control
    Venturini, Federico
    Mason, Federico
    Pase, Francesco
    Chiariotti, Federico
    Testolin, Alberto
    Zanella, Andrea
    Zorzi, Michele
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (03) : 955 - 969
  • [27] Evaluating reinforcement learning algorithms in first-person shooter games using VizDoom
    Adil Khan
    Muhammad Naeem
    Multimedia Tools and Applications, 2025, 84 (15) : 15053 - 15075
  • [28] Altruism and Selfishness in Believable Game Agents: Deep Reinforcement Learning in Modified Dictator Games
    Daylamani-Zad, Damon
    Angelides, Marios C.
    IEEE TRANSACTIONS ON GAMES, 2021, 13 (03) : 229 - 238
  • [29] Learning in Games via Reinforcement and Regularization
    Mertikopoulos, Panayotis
    Sandholm, William H.
    MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) : 1297 - 1324
  • [30] Deep Reinforcement Learning and Influenced Games
    Brady, C.
    Gonen, R.
    Rabinovich, G.
    IEEE ACCESS, 2024, 12 : 114086 - 114099