WagerWin: An Efficient Reinforcement Learning Framework for Gambling Games

被引:1
|
作者
Wang, Haoli [1 ]
Wu, Hejun [1 ]
Lai, Guoming [2 ]
机构
[1] Sun Yat Sen Univ, Dept Comp Sci & Engn, Guangzhou 510275, Peoples R China
[2] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Artificial intelligence; Training; Reinforcement learning; Training data; Monte Carlo methods; Law; Gambling games; game AI; reinforcement learning (RL); NETWORKS; POKER; GO;
D O I
10.1109/TG.2022.3226526
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although reinforcement learning (RL) has achieved great success in diverse scenarios, complex gambling games still pose great challenges for RL. Common deep RL methods have difficulties maintaining stability and efficiency in such games. By theoretical analysis, we find that the return distribution of a gambling game is an intrinsic factor of this problem. Such return distribution of gambling games is partitioned into two parts, depending on the win/lose outcome. These two parts represent the gain and loss. They repel each other because the player keeps "raising," i.e., making a wager. However, common deep RL methods directly approximate the expectation of the return, without considering the particularity of the distribution. This way causes a redundant loss term in the objective function and a subsequent high variance. In this work, we propose WagerWin, a new framework for gambling games. WagerWin introduces probability and value factorization to construct a more effective value function. Our framework removes the redundant loss term of the objective function in training. In addition, WagerWin supports customized policy adaptation, which can tune the pretrained policy for different inclinations. We conduct extensive experiments on DouDizhu and SmallDou, a reduced version of DouDizhu. The results demonstrate that WagerWin outperforms the original state-of-the-art RL model in both training efficiency and stability.
引用
收藏
页码:483 / 491
页数:9
相关论文
共 50 条
  • [31] Baselines for Reinforcement Learning in Text Games
    Zelinka, Mikulas
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 320 - 327
  • [32] A Transfer Reinforcement Learning Framework for Smart Home Energy Management Systems
    Khan, Murad
    Silva, Bhagya Nathali
    Khattab, Omar
    Alothman, Basil
    Joumaa, Chibli
    IEEE SENSORS JOURNAL, 2023, 23 (04) : 4060 - 4068
  • [33] Reinforcement Learning in Video Games Using Nearest Neighbor Interpolation and Metric Learning
    Emigh, Matthew S.
    Kriminger, Evan G.
    Brockmeier, Austin J.
    Principe, Jose C.
    Pardalos, Panos M.
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2016, 8 (01) : 56 - 66
  • [34] Optimizing Reinforcement Learning Agents in Games Using Curriculum Learning and Reward Shaping
    Khan, Adil
    Muhammad, Muhammad
    Naeem, Muhammad
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2025, 36 (01)
  • [35] Reinforcement Learning Approach for a Cognitive Framework for Classification
    Barth, K.
    Brueggenwirth, S.
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [36] Interactive preference analysis: A reinforcement learning framework
    Hu, Xiao
    Kang, Siqin
    Ren, Long
    Zhu, Shaokeng
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 319 (03) : 983 - 998
  • [37] A Stable Deep Reinforcement Learning Framework for Recommendation
    Liu, Ruochen
    Jiang, Dawei
    Zhang, Xilong
    IEEE INTELLIGENT SYSTEMS, 2022, 37 (03) : 76 - 84
  • [38] Gradient Monitored Reinforcement Learning
    Abdul Hameed, Mohammed Sharafath
    Chadha, Gavneet Singh
    Schwung, Andreas
    Ding, Steven X.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4106 - 4119
  • [39] Exploring Deep Reinforcement Learning for Battling in Collectible Card Games
    Vieira, Ronaldo e Silva
    Tavares, Anderson Rocha
    Chaimowicz, Luiz
    2022 21ST BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES), 2022, : 49 - 54
  • [40] Evaluating Critical Reinforcement Learning Framework in the Field
    Ju, Song
    Zhou, Guojing
    Abdelshiheed, Mark
    Barnes, Tiffany
    Chi, Min
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT I, 2021, 12748 : 215 - 227