Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引:2
|
作者
Qiu, Zizhang [1 ]
Wang, Shouguang [1 ]
You, Dan [1 ]
Zhou, MengChu [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China
关键词
Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;
D O I
10.1109/JAS.2024.124488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
引用
收藏
页码:2111 / 2122
页数:12
相关论文
共 50 条
  • [31] Learning Relevant Questions for Conversational Product Search using Deep Reinforcement Learning
    Montazeralghaem, Ali
    Allan, James
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 746 - 754
  • [32] Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Models
    Guo, Hongye
    Chen, Qixin
    Xia, Qing
    Kang, Chongqing
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (06) : 5684 - 5696
  • [33] Autonomous Earthquake Location via Deep Reinforcement Learning
    Kuang, Wenhuan
    Yuan, Congcong
    Zou, Zhihui
    Zhang, Jie
    Zhang, Wei
    SEISMOLOGICAL RESEARCH LETTERS, 2024, 95 (01) : 367 - 377
  • [34] Road Planning for Slums via Deep Reinforcement Learning
    Zheng, Yu
    Su, Hongyuan
    Ding, Jingtao
    Jin, Depeng
    Li, Yong
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5695 - 5706
  • [35] Deep Reinforcement Learning for Strategic Bidding in Electricity Markets
    Ye, Yujian
    Qiu, Dawei
    Sun, Mingyang
    Papadaskalopoulos, Dimitrios
    Strbac, Goran
    IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (02) : 1343 - 1355
  • [36] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
    Jia, Shuai
    Gan, Zhongxue
    Xi, Yugeng
    Li, Dewei
    Xue, Shibei
    Wang, Limin
    JOURNAL OF THERMAL SCIENCE, 2020, 29 (05) : 1125 - 1134
  • [37] Beyond Trial and Error: Lane Keeping with Monte Carlo Tree Search-Driven Optimization of Reinforcement Learning
    Kovari, Balint
    Pelenczei, Balint
    Knab, Istvan Gellert
    Becsi, Tamas
    ELECTRONICS, 2024, 13 (11)
  • [38] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
    Shuai Jia
    Zhongxue Gan
    Yugeng Xi
    Dewei Li
    Shibei Xue
    Limin Wang
    Journal of Thermal Science, 2020, 29 : 1125 - 1134
  • [39] Bin Packing Optimization via Deep Reinforcement Learning
    Wang, Baoying
    Lin, Zhaohui
    Kong, Weijie
    Dong, Huixu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2542 - 2549
  • [40] Adaptive Design of Alloys for CO2 Activation and Methanation via Reinforcement Learning Monte Carlo Tree Search Algorithm
    Song, Zhilong
    Zhou, Qionghua
    Lu, Shuaihua
    Dieb, Sae
    Ling, Chongyi
    Wang, Jinlan
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, 14 (14) : 3594 - 3601