Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引:2
|
作者
Qiu, Zizhang [1 ]
Wang, Shouguang [1 ]
You, Dan [1 ]
Zhou, MengChu [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China
关键词
Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;
D O I
10.1109/JAS.2024.124488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
引用
收藏
页码:2111 / 2122
页数:12
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Using Optimized Monte Carlo Tree Search in EWN
    Zhang, Yixian
    Li, Zhuoxuan
    Cao, Yiding
    Zhao, Xuan
    Cao, Jinde
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 544 - 555
  • [2] On Monte Carlo Tree Search and Reinforcement Learning
    Vodopivec, Tom
    Samothrakis, Spyridon
    Ster, Branko
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 60 : 881 - 936
  • [3] Automatic Bridge Bidding Using Deep Reinforcement Learning
    Yeh, Chih-Kuan
    Hsieh, Cheng-Yu
    Lin, Hsuan-Tien
    IEEE TRANSACTIONS ON GAMES, 2018, 10 (04) : 365 - 377
  • [4] MetroZero: Deep Reinforcement Learning and Monte Carlo Tree Search for Optimized Metro Network Expansion
    Alkilane, Khaled
    Lee, Der-Horng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 810 - 823
  • [5] Monte Carlo Tree Search With Reinforcement Learning for Motion Planning
    Weingertner, Philippe
    Ho, Minnie
    Timofeev, Andrey
    Aubert, Sebastien
    Pita-Gil, Guillermo
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [6] Learning the Fastest RNA Folding Path Based on Reinforcement Learning and Monte Carlo Tree Search
    Mao, Kangkun
    Xiao, Yi
    MOLECULES, 2021, 26 (15):
  • [7] DeepMCTS: Deep Reinforcement Learning Assisted Monte Carlo Tree Search for MIMO Detection
    Mo, Tz-Wei
    Chang, Ronald Y.
    Kan, Te-Yi
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [8] Reinforcement Learning - A Bridge Between Numerical Methods and Monte Carlo
    Borkar, Vivek S.
    PERSPECTIVES IN MATHEMATICAL SCIENCES I: PROBABILITY AND STATISTICS, 2009, 7 : 71 - 91
  • [9] Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning
    Balaz, Marek
    Tarabek, Peter
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [10] MCTSteg: A Monte Carlo Tree Search-Based Reinforcement Learning Framework for Universal Non-Additive Steganography
    Mo, Xianbo
    Tan, Shunquan
    Li, Bin
    Huang, Jiwu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4306 - 4320